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Abstract 


The author has identified the frllcmng significant results. 


ihis Final Report deocrlbes an objective system for regional is ation using 
the ERTS-1 (or LANDSAf) computer compatible tapes. A riuige of computer programs 
for {malysis of these tapes has been developed and these are described. Although 
some work has been carried out using Individual pixels, the en^hasls is on a 
level of generalization appropriate to a aatelllte system with repetitive global 
coverage. Work on establishing the main variables Influencing reglpnalisation 
is Incomplete, but the major variables ore land/wStolf^Wtlos , and vegetation 
cover. The scale or texture of the pattern of change in these Vafiables varies 
a good deal across the earth's surface, and it seems best if the unit of 
generalization adopted varies in sympathy with the surface being analysed. 

Go^uter generallzatipn and analysis cannot keep pace with data collection without 
^reasonable investment in machine power, bac It offers a degree of data reductlGr^ 
which Is highly attractive. 






OONTENIS. 


0. 

General ^qnraisal 

1 

0.1 

Resetu^ finance 

1 

0.2 

History and concept 

1 

0.3 

Conclusion 

4 

0.4 

Form of this report 

4 

1. 

ih^oduction 

S 

1.1 

Objectives 

S 

1.2 

GeaeraLisation of the ijnageiy 

6 

1.3 

Generalisation loiits 

6 

1.4 

Method of generalisation 

7 

1.5 

Data base for this investigation 

7 

2. 

inA locatioi and con^ilation 

9 

2.1 

Introduction 

9 

2.2 

Techniques for conftiling the UTAs 

9 

2.3 

Results 

10 

2.4 

Discussion 

10 

2.5 

T 

Display teclmiques 

12 
1 X 

V* 

3.1 

vrS|/ SolllplLlllg 

^hn^roduction 

13 

£t 

3.3 

AJiw b'A' CUlAiAiwJL iLwmi»'bAma 

The effects of the non^ipiform pri 

Dbabilit] 


on the histograi 

h 15 

3.4 

IdCTtification of scanner irregularities 

17 

3.5 

Discussion 

19 


4. The dialysis of p^-seale histograiK 21 


. ^tfoduction 21 

TeGhnique adopts 22 

Brief outline of t^e maxiimln likelihood 



method 

23 

4.4 

application to the probl^ 

25 

4.5 

Residts 

26 

4.6 

Discussion 

27 

5. 

The t£3e of clustering tecl^ques 

29 

6. 

Edge detection 

30 

6.1 

6.2 

Introduction 

Tedauque 

30 

30 

6.3 

Results 

51 


6*4 Discu$sion 32 





CXVIIENTS (contiraied) 


7* Gon$uter programs 34 

7.1 ^ Sub<^routines 34 

7.2 ‘ Gcnpiter program ava.ilable at 

Caii^ridge, U.K. 34 

7.3 Siq^rt programs at UEA 37 

Publications referred to m sections 0-7 39 

Figures referred to iii sections 0^7 4Q 

/{ipcndix: Piblications of the project $& 





0. General Appraisal 


RgPBODUGIBiLITY OF 18B 

SgSqinal page is POOR 


6,1 Resear.ch_Floao_ce 

The research carried out for two years from September 1972 was 
financed by the United Kingdom Department of Trade 4 Industry. 
Since October 1973 a further Research Associate has been 
Working on the material: He is funded by a Research Grant from 

the UK Naturai Environment Research Council. 


0. 2 History and concept 

In Common with Other investigators, the priginal proposal was drawn 
up well in advance of the launch of ERTS-^1, and without access to 
. any Simulated imagery or Gomputer=eoropatible material* We were 
aware 1) that it would be pointless to duplicate existing United 
States work in the use of multispectral imagery and felt 2) that 
too much attention was being paid to interpretation (by whatever 
teehnigue) at the limits of resolution of the system. Conversely 
it seemed that there might be room for a United Kingdom contribution 
that touGhed on two other aspects of satellite imagery, viz ; 1) the 
eonsiderable data^handling problem, Suggesting that a Computer^ 
Gompatibie approach would be of value, and 2) the unigue opportunity 
offered to aGhieve global or near-global coverage on a consistent 
basis. We were also aware Of the potential for monitoring Change 
provided for Gloud-free areas by the 18'^day repetition of imagery. 
This did not form a ma:gor focus of our original proposal, but was 
borne in mind in GGnsidering approaches that might reduee tho amount 
of data reguiring detailed interpretation. 


sguare, areas and the extraction Of generalized information from 
these areas* fhey were designated UTA's (Unit farget Areas) and 
were seen as the basic building block of a global system for 
generalization and the monitoring of major change in Successive 
images* With little more to guide US than a 'feel' for an 
appropriate scale, we proposed that initialTy the UTA should be 
56 X 56 km. It was argUed that. Since botaniGai .distributions 
based on a 16 km grid provided very Satisfactory maps for the 
British Isles at about 1:5 qoq OOO, continent^wide or global maps 
at 1:26 666 600 or smaller could well be based on a UTA of 
50 X 56 km. We further argued 1) that the wish to eSeape 
recognition Of such Gonventional elements of air photo reeognitton 
as towns, fields, ridges and valleys (as opposed to agrieultural 
areas, mountain ranges Or deserts) requirod a reasonably large UTA; 








while 2) ahtlGipated problems of precise location of UTA’s suggested 
that smaller areas would suffer from mose 'noise' due to inaccurate 
replication Of the area on successive passes; 3) finally, a larger 
UTA reduced the size of the data by the largest factor. 

An early decision involved our choice of computer on which to 
develop the System. The local university machine is an IGL and had 
the particular disadvantags that it reguired 7—track tapes. Although 
these were offered by NASA, it seemed that the 9^ track tapes might 
be easier to handle. Additionally, it seemed sensible to establish 
a system on a machine readily available elsewhere, particularly in 
the USA. As we have aceess to a 'regional' computer the IBM 
370/165 at the University of Cambridge, and a terminal has been 
installed at the university of East Anglia during this contract, 
it seemed the Obvious choice, we have suffered problems of slow 
turn roimd, particuiariy in the early stages of the program 
development when the terminal was not instalied. Nevertheless, it 
was the right chOiOe. 

Like other investigators, whether at home or abroad, we made a sic>w 
start due to the late arrival of erts-^ 1 products, particularly the 
computer tapes. Since not every^lng can be pursued at once, we 
concentrated On program development to exttaot gray- scale data for 
^A's from the tapes, once histograms were available, we spent part 
of Our effort examining ways of handling th®m. All throuifh ^is 
stage (which lasted ©ver a year) we in general assumed that - 
1) the tapes matched the images we had of the same scenes? 2) that 
the geographical coordinates on the tap®® were accurate within 
1“2 xm and so were not likely to distort our UTA histograms, and 
3) that the histograms produced by summation of the ntrnber of 
occurence of each value across the entire 128 points of the scale 
were a valid basis for analysis and eomparison of sucGessive images. 

As our technigues were developed it became cl ear that these 
assumptions were not wholly justified. One disappointing feature 
of research at this stage was the discovery that the slow speed 
at which results from other investigators were becoming known t© us 
Would force us to examine some guite .elementary aspects of the 
digital tapes. We had hoped to be able to build On results Of other, 
larger , teams , but this was rarely possible, partly beGause so few 


seemed cpnGerned with analysis^ rathe? than interpretation, of the 
digital tapes. 

Once it became clear that problems of location and of histogram 
characteri sties would require investigation, much of the effort we 
Were devoting to generalization had to g© baok to the individual 
pixel values. From this we were able to map by line-printer such 
features as fields, reservoirs and rivers in our Central Californian 
scene. From this we learnt that the photo images and the tapes did 
not GOincide in coverage, while we also noted errors in position of 
up to 1© km using the latitude, longitude grid as shown round the 
edge of the seene and recorded on the Computer tape. As these 
; errors are large compared With the size of ©^A we propose to use» 
it has become necessary to adjust the location information for 
each pass so that our UfA’s may be accurately located. Our reports 
have also noted progreis in smoothing the histogram valued to 
provide a satisfactory basis for the comparison of successive scenes 
of the same area, or different parts of the Same image. 

in the generalization of information fro® erts images, one approaGh 
is to map, by percentage area or other suitable measure ,• such 
simple variables as water* or snow and ice. An alternative approach 
we have been trying uses arbitrary square UTA’s. The gray-scale 
histograms Of these are compiled and they are then clustered using 
a ciuster^analysis program. These are indications that the major 
regional groupings seen on ERfS imagery may fall's^out from such an 
analysis without the need to use a eontiguity cGnstraint# if so 
(and we are still investigating the effect of varying the areas used 
as the building blocks Of this process) then this Will be a most 
satlsiactory ©b ject^ ve Classification technique for mapping at a 
small scale. 


ReglGnalization at a global or Continental scale is at present likely 
to be based rather subjectively ©n elimate^^^j^egetation and relied* 
Since vegetation is in large part dependent on climate (including 
Climate as modified by relief) the relatiGnShips are complex and 
the relative weight of the three variables varied from one person 
to another, and from one area to another. An objective classification 
based on statistical analysis of gray^scale values could provide a 
welcome alternative approach to geographical problems of global/ 
continental regionalization . 


4 

In developing this approach we have been aware that there are other 
visual charaeteristics of the earth's surface that would be relevant 
to our classification. Two of the most likely variables are the 
presence of rectangular or other checkerboard patterns which are most 
likely to result from agriculture i - and the texture of ridge and 
valley patterns in mountainous or upland areas. The first are most 
resdily derived from edge-detection technigues while the latter 
can be more elusive r although where they are well marked, we have 
been able to detect them by searching for * edges ' . Fourier 
transformation may also be of value, although rather slow. Edge 
detection will also be effective in picking up lake and snow edges 
* in areas where these occur. However, summation of approximate water 
areas by count of very low gray-scale Values is probably more 
appropriate for the high degree of generalization we seek. 

0.3 Cencluaion 

In GoneluSion we are convinced that too little attention is being 
paid to the value of ERTS imagery as a consistent data-base for 
global/eontinental generalization (regionalization and mapping) . 

If the information can be analysed in an objective manner , the 
ecmdyination of consistent data and replicable analysis will be 
a notable addition to the information we have ©n global 
distributions and patterns. 

0.4 Form of this Report 

The remainder of this Final Report to NASA consists of material drawn 
from earlier Brogress Reports , updated where necessary. In this way ^ 
the general points made in this General Appraisal and the Introduction 
are developed in more detail. The main publications of the research 
team are also included. The delay in submission is regretted . It is 
largely the result of repeated delays in commissioning a printer/ 
plotter; it had been hoped to demonstrate that this represented a 
cheap solution for graphic output that Would match the computing 
speed of the IBM installation. The delays here have been coneerned 
with the completion of hardware and have been wholly outside the 
control of the team. 
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1. INTRODUCTION 1 

1.1 QbJectivBii / ? 

.? 

Th» ala of Ihis project is to detlse a eystn for «enera34siiig | 

rsBotoly sensed data for aress of the earth's stores at a scale | 

•»proprlate to a world data eolleetion systan. Snccesefttl eensralisatioa 1 

?S‘ 

will allow the identifloatton ot ohahses to surface cower o er I 

suoeesslwe orhits, and naj he used for snail scale aapptog of Ihese I 

<dianges. 

®his syatan^ if satisfactory, could be used as a ^st stage I 

data filter, seleettog for further analysis those ioages which show I 

sljpillcant changes from earlier imagery, it is desirable that the 
system uses data which has \mdergone tto Btotoun aaouat of 

processtog, and hence bUMc (system corrected) lass toagery has beoi I 

used throughout* I 

Stoce it is Hfcely that any operational eartih obserwation satellite 
will, like HITS^, return the images as a digitized picture, there 
are many advantages in devlstog a system capable of processing this ^ 

»ew digital data* Among these advantages are the reduction to alow 
end costly photographic processtog. the potential for developing | 

a fully autototic eystem, and the abiiiiy to make use of the full I 

— - • Ir 

dynamic range of toe eensore without toe degradation to radiometric 1 

fidelity introduced by photographic proceseingr"li|8o futtro eystems | 

wiU possibly have sensors toich do not yield information suitable fbr 
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1*2 Qwwfalimtion ot «i# 

iMh BISS firuo ocntaina 3 x 10*^ pixftls (jdetarx •IwMts) tot 
•ttia flour MSS bands, idisross a rsglonsX nap at a seals of, say, 1 t 10^ 
could show 10 » 10^ indopandsnt data points In an equivalent area* 

It is thus obvious that sons generalisatioa of tbs SIS laageiy is 
required for regional scale investigations* 

*his enesalisatlon can be introduced an tao ways. She ecnveiitional 
approach is to classify the iaagery into surface ^es, and to use 
these surface types as a basis for generaisaticn. One disadvantage 
of this approach is the expense in eonputiag powsr required to olassify 
each p^ei quite apart fron the difficulty of recoenlainff 'standard 
types* for such classificaticn* in alternative systea is to 
generalise the rav data prior to ClasaificatsLon and napping cn a re^onal 
eeale. Ihis second approach is adopted for this proieot, sinoe it offers 
ab» capabmty of efficiently aoMtordng tl» Saagery on a regional scale. 

She types of feature idtieh should be significant on this scale 
are, for exwaplS(> large grass bunas in savanna regions, or snoir cover 
in ms catbtoent area of a reservoir, it is hoped that the systen 
described here co^ Identify this type of change against a background 
noiso of saaller scale changes. 

1.3 (leneralisatlm 

One of the aims of the project is to dsteiniae the area over 
iMCh the data be effectively generalised and yet still give a useful 
representation of the iaage. It is probable that this area idll be 
a fimction o£ the teawaln type and the features of interest* 


c 
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Xnltiall 7 » square Unit Tsst Areas (UTAs) oorsriaf 2.5 x 10^ 

heotaxss have bssm sae<l« These UTAs aare AefinedL hr tiie ^eogrephleal 

♦ 

eoorUaates of thedr eeatre points# and have edees parallel to 
lines of latitiide and longitude* Im some cases it ai|iit be desirable 
to use a Wk oMndding with geocpraiiiieal bomdaries# and to allow 
for this TJTAs wiihi polygcmal outlines say also be used* 

1.4 Method-of Generallaation C 

The generalisation nethod eliiOh has been adopted Oharaotdises 
each tJTA by its ffray scale frequency ddstribotion. The paraaeters 
used for generalisation could be quantities dedwed IToii those 
distributions, suOh as the mean and the standard devlationt or they 
could be the histogran that represents that disixibtttion* The latter 
be used in investigation , because me gray scale histograms 
are the simplest way of handling the infomation contained in the 
frequmicy distrihutioa* fbr moat purposes the four KSS bands will 
be toe ated independently, but greater senaitivlty couli be aohieted by 
using a frequency distribution in four dimensional measurement space. 

More complex methods of generalisation such as the power epectrum 
of the structure in the image could ^ve parameters related to the 
photogranetrio use of texture* The uee of such alternative means 
of genmalisation will be investigated at a later stage of the project 

1.5 Data-Base jbr this- InvOatlgatlOtt 
Two test sites have heen ehpaen, although adequate coverags is 

only available for one of these# uhiidi is «n area in the Central 
Talley of California# and inoludes pert of the Sisrra Hevada and 
the Coastal Range* The eeeond test wea is in Eastern Boglend. 


Hie 70tnm bulk negatives for each of the sceiies including 
part of the test area have be^ examined, and CCi^s obtained 
for several of the scenes. The centre points, seine identifiers 

» V 

and lo^l tape code names for these tapes are listed in the 
table below. 


TABLE I. 


Local code NASA scene 
identifier 

Centre point 

Date 

Area 

Tape B 

1038 - 18114 

37 27 N, 120 22 W 

30 Aug 73 

C. valley, 
Calif. 

Tape C 

1031 - 10334 

S3 N, 0 W 

23 Aug 72 

E. England, 
The Wash 

tape D 

1056 - 18114 

37 23 N, 120 22 W 

17 Sep 72 

Calif. 

C. valley 

Tape E 

1308 1 18122 

37 34 N, 120 37 W 

27 May 73 

Calif. C. 
valley 

tape F 

1228 - 10293 

51 40 N, 0 06 W 

8 Mar 73 

S.E. England 
London area 

tape 6 

1308 - 18120 

38 58 N, 120 08 W 

27 May 73 

Calif. Lake 
Tahoe area 

Tape H 

1307 - 18071 

36 08 N, 119 30 W 

26 May 73 

If 

Calif. S. C. 
valley 

(B^rsfield) 

Tape 1 

1307 - 18064 

37 34 N, 119 06 W 

26 my 73 

Calif. Lake 
Mono & Sierra 
Nevada 






2* VTA Location and Compllittlon 
2.1 Introduettm 

Of primary importanee for automatio generaliaatioB of oatelliie 
lma«ory, ia tho abiilty (i) to align the digital data to geographical 
ooordixkates» and (ii) to aeleot thoao pizela vhich lie vithln a 
apeoif lad target area* 

fhare are aoeeral mya in whioh this aii^iimt oould be made, 
but for an automatio aystan operating on a global aoale, Ihe moat 
hopeful in tema of computing time ia to i»e the i^ormattoh Bailable 
on Ihe aatellite position and attitude, ms is the method used ty 
VASA for meteorological satellites and for ERd?S bu3^ (system corrected} 
imagery. From predLoted or measured wbital parameters ^ ®TS 
processing fa<m.lity calculates the latitude and lon|^tude of bolh 
Ihe sub-satellite point and the central point of the image. Tl» 
latter correeponds to the intersection of tile principal of the 
RB7 cameras irtth the earth's awfaCe and depends cn the attitude of 
the Satellite. !^eee tiio points are then used to superlmpcse a 
latitude-longitude grid on the image. Tha pre«laimch estimatea of the 
accuracy of this grid were about 500 a, lAieh implies that a 50km square 
Ma could be located i^th a IjS OxfOr. this is sufficiently accurate 
for the Sort of changes that could be detected by. this method. 

2.2 techniques for eemtllln g^e 

t^As are defined by the geographical coordinates of thejr Comer 
points, and compiled fTm the GCTs In tuo stages: 

i) the conTOrSion from latiti^e - longitude ccorddnates tO scan 
line, pictwe element coordinatea (tape coordinates) is performed by 
linear interpolation betTOen the tape coordanates comespondittg tO the 
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REPRODUCroEillY OP THE 
oBiaiN^ PAGi ig pggg '' 

^temection of tte latltiido«lozi^tudo ^1h tho edge of Ihe frano 
ttaliig the grid l^o^tion eontained in the ahnotamm teoord at ^ 
atari; of idia tape. ' 

ix) irhO distal data cotreepohdilhg to a tJSA ie eompiied hy 
seiLecting froa the CCf s ^oae polnte idiidi fall hetwean linee oonneoting 
Ihe corner pointa of the UTA. |he tIfA data la then stored aa a 
.a^arate data file oonaietlhg of eegBonta Of scan line of varying lengtii. 

2.3 Regi^ts 

!|he aecuraojr of coordinate confersiOns perfoxned hy this ae%Od have 
heen teated nsing easily' identified control ppMts for the Californian 
inages (f apes B, D, and e) * The gBogFaphiCal cocNrd^tes tere taken 
off the 1 t 250 1 poo oapSf and ci^^rted to tape eoor^natea • A 
sectibn of the Image stproiinding the tape coordinate vas displayed 
using line printer grc^-^e^e plots, gave an indication ol the 

positional errors. 

VigurO 1 show Iho results Obtained fbr the dam of Honey ieke 
in G^lforx^. ^e positive found for the dam On tapes |3 and B3 are 
aarked hy open eirclOs, and shov errors in the rangs of $ to 10 M. 

Tests for other parts of these images show that the e^r has a 
largo oonstant component, and a smaller component i^ch varies across 
the image and has a magnitude of about 1 Iqs. 

2.4 MsGusslon 

The spiu'ee of the ma^or error is almost entirely in the latltttde<- 
longitude grid data proi^ded by MSA, and similar errors are fotmd 
from the grid siq^erimposed on the photographic products. It is 


possible l^t 1^ i^Uaer vttat IS due io ISiS eoord^fiate eontirSloSi 
SlgoritM* ^19!^ is pearidcttlsr3y litoly near the eornerS of tiis 
tnafi^S where ^d m^ks are sparse, and linear interpolation iS 
less reliable beoanse of gec^tric dietortion. 

These eisors in the posiMoning of the digital data relatire 
to tile tISAS are too large to allov reliable oonitozing f<»* dianges 
in the inagexy. 

4 

Two solutL^ Bxe possible for the purposes of tMs invesld.gatlon, 
Sither the tJfAs can be defied ptrely in terms of tape cooruhlnates 
md subsequent images aligned ly use of a correction T;hi6h initially 
would have to be determined menually, or the geographicai 
coordinates could be Converted to tape coordioates t^th due ^iovanee 
made for the exror in the grid. The advantage of the fhpst is that it 
removes tiw need for mpaCking the annotatiQn recwd, and ep reduces 
the core store required for prp^ams, whitih would brins useful savings 
in oomputer usage. The second alternative allo^ gep^aphically 
Significant (or at least interesting) areas to be Used mpre easily, 
but at the e^ense of incfeased computer Usage* 

an interesting eicerclse in t^e context is tbe use of a 
erCSs*^pi^e|ation tec^dqua to align one image witii another. TMs is 
a standard te<^tique in crystallOi^aphie and in biologLcal image 
processing uhere the image consists of sell defined objects seen a^inst 
a unicorn background, but it is not immediately obvious that it sill bs 
Successful in alining remotely sensed images ^re there is no clear 
^stinction between object and baekgrotmd. penaity slicing Is one 
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means of reducing images to a Col lection of objects on a uniform 
background, and this offers the most promising line of action. 

This approach is currently being pursued. 

Alternatively, the tape coordinates of features recognisable on the 
photographs COuld be taken directly from a coordinate grid superlm^ 
posed on the photograpA, provided that the tape and photographic 
imagery are coincident. However r it appears that there is an 
aiong~traCk displacement of tape imagery relative to the photographic 
imagery by as much as 8 km, and so there Is no advantage is using 
this method. The relative displacement of the ^o types of 
Imagery has been found for all caseS fOr Which we have CCTs. 

2 . 5 Display Techniaues 

In order to locate recognisable ground features in terms of tape 
coordinates, some form of display of the data on the tapes is 
needed- Two forms of picture display are available which are to 
some extent eompleraentary, and both will be maintained. 

1) Line printer gray scale pictures simulate the photographic 
imagery, and thereby provide a rapid display Of the data. However, 
they Suffer from the disadvantages of mapping at too small a scale, 
and of a limited range of gray tones ^ For speed, no allowance is 
made for the skew on the imagery, or for the distortions imposed 
by the use of a standard line printer, although In principle this 
could be done. Therefore, the resulting map is not geometrically 
accurate, and is not conparable, in the overlay sense, with , 

eonventional maps . 

ii) Contour maps of the gray scales drawn by the graph plotter 
have Corrections for the Skew and sampling effects, and have an 
easily variable scale. In addition there is a wider range of 
gray scale Information available. The disadvantages are the time 
taken to produce a single display, the difficulty in contouring 
areas Of rapidly Changing gray tone and the lack <Of an inmiediate 
interpretation of the display. 

Figure 1. provides an example of a gray scale printout and Figure 2, 
shows a sample contour map. 
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3. Gray SealjS^SMipling REPBOPUCpS^n^^r^vS^ 

OWOINAL PAGB IS POOR 

3 . X XnModugfelQB 

Radiometric errors in the GGT imagery impose iimitatlons on 
the use of the gray scale histograms. It is hecessary to 
investigate the magnitude Of these errors, so that the 
significance Of partlopiat differences hetwoen histograms 

can be assessed. 



The sources of radiometrio error to be investigated are:^ 

1) those related to the conversion from radiance to 
gray scale levels on the ggTs 

ii) those eaused by the uneven response of the sensors. 

3.2 The MSS Transfer Function 

A detailed description of the MSS transfer function, which 
determines the conversion from radiance to gray^scaie values 
is given in the section "System terformance" in the ERfs 
pata users' Handbook, a brief smnmary will be given here. 

The radiance » Rf to tape count (gray scale values on the GCf ) , 

T, transfer* function is always linear, but as will be seen, 
the a priori probability of any pixel having a tape count T is 
a function Of T itself: that is> the probability distribution 
(as opposed to the freguency distribution) , is a non-^iiniform 
function of f . The cause Of this noh^uniformity is the discrete 
nature of the gray scales, and becomes more apparent when 
intermediate transfer functions , f between R and the sensor 
count# s# and f 2 between s and t, are considered. 

For band 7 alone# the transfer function f iS linear and s has 
a range of o to 63. The Second transfer function § 2 # which is 
related to the proGesSing of the raw sensor data by the Special 
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Processing Subsystem (SPS) , Is a|s© linear and f has the same 
range as S. Miong the operations performed by SPS is 
radiometric Galibration ©£ the sensor data and this could 
cause fg to differ from a One-^tO-one mapping of S onto T. 
if, for excunpie, two different sensor Counts, Sj^ and S2» 
are always mapped into a single value, f^, then the 
a priori probability of obtaining will be twice the 
mean probability* 

The non-uniformity introduced by this effect will be spread 
over adjacent gray levels only and could be removed by 
smoothing with a simple rianning mean technique. In practice 
the Galibration function changes Gontinually# and this 
smoothing does not appear to be necessary. This is shown 
by Fig<>3 which is the unsmeothed band 7 histogram for a 
SO km square area in th® Gentrai valley in Galifornia. 

The sensors for bands 4, 5 and 6 are usually operated in 
the Gompressed data acguisition mode, in which the sensor 
transfer functioh f^ is non-linear, with the sensor Gount 
scale compressed for high radianGe values. The range of 
S is again 0^s4igB2. To GorreGt for the non-^ linearity 
of i^f the SPS transfer function fg is also non-linear 
and can be approximated by a guadratic of •tdie form 

T 0.025 + 0.45 S (D 

This gives a tape count scale which is linear with a range 
O^T 4i27^ or twice the range ©f S. Because of this 
difference in the ranges of s and T , the transfer f unGtion 
cannot be a one-to-one mapping. Without the smearing 
introduced by the Ghanging Galtbration ■function which 
modifies equation (I) , the proba,bility Of obtaining at 
least half of the f values would be zero. Because of the 


non-linearity low T values will correspond to two or more 
S Values f whereas at the hifh end of the radiance scale 
only one in every two of three T values would have a non 
zero probability* 'fhsi is illustrated by Figure ^ which 
shows the mapping of the 64 S values into f on the 
assximption that £2 represented exaGtiy by (1) . This 
distribution shows that the a priori probability of 
obtaining T » 0 is approximately six times greater/ en 
average, than that of obtaining any particular value 
of T greater than 120. 

3*3 The— Effeets Of the non-unifOrm probability on the 



The gray scale histograms derived from the imagery are 
the product Of this nOn-uniform probability distribution 
and the actual gray scale freguen^ distribution. The 
effect on the histograms is illustrated by Figure 5* 

Here each line Shows the histogram for 12 scan lines 
of the band 5 for Tape B3. The tape counts have been 
combined in pairs to give a range of 64 values. One of 
the most prominent features of the histograms are the 
columns with anomolously low fregueneies whieh persist 
over severai sets of 12 scan lines, and which are 
associated with gray levels for which the a priori 
probability is low or zero* Much of this irreguiarity 
is due to the stretChlhg of the original 64 ^scale gray 
levels to the 128 gray levels on the tapes as described 
by Thomas (1973). . - — ^ 

• 

BeGause the exact form of the mapping f^ from S to 
T chaynges from sc^ line, and from date to date, 
the lower limit of the detai-l ih the histograms 
which can be accepted aS real Is approximately 
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OSielHAI. PASS IS 


ih» reu^ of f nMdi eorzospoDds to a single value of S* Beeause 
of ibe BoU'BlineEUC'.ily of tbis is a flmcid.an of T* i*e* sutler 
ohmgos can be detected in me radiance of dark features than can be 
detected m brigiiter features* 

The irregularlly of me histograas can be reduced* end at me 
sane tine me redimdanoy in the grSy scale values due to the 
expansion of me range bj f^* can be removed Smooming me 
histograms vim a suitable oonVolution fimction* The aiii of this 
conTplutipn is to derive a 64 level scale for each level 
an equal ^ nclori probability* The probability of obtaining a 
value betT^een T end T 4 - 41 T is related to the probabiiiiiy of obtaining 
a value betveen S and S *f by 



For a unifprB distribution in s* this reduces to 

f' 


yields 


= 0-0? S* Ok? 


mich* on Substituting for S gives 

IJ 




( 4 ) 


The i^dth of me 64 int^valS in T vhieh have equal a 
probabilities is found friaB ( 3 ) and ( 4 ) 

- - o-or/m A.0T * 


The centre points of these intervals is given by 


% == 0 
^ ^ Sr 

u - a. 


Mi. 


v~V 


fv*1 


(6) 




i^ldminarjr tests tove been Bade nsing a trapesoidal 
oonvoiutien fimctton iddieh has a wldiAiet haU height f^vea by (S)( 
outer edges vi& elopes of +1 and » It and an equal wd^tlng for 
all ppints near the centM of the intOTval (SLg.6) 

Qhis conTolution fuhetlm has the advantages that it Is smoothly 
vnying over ^ range of Tf and ^iows for tbs ohs<«to^tiro or one- 
to^tiffee nailing at hl^ T values by vei^ting equally all values 
near ‘U)s centre of the idadov. ^gors 7 shows a hlstogrsUB both 
unsBOOthed and SBOothed using this convolution function, fhe full 128 
f values have been used in plotting the histo^ranit although at iost 
64 of then are independent t and a 64 point non-linear scale watdd 
be more appd]^ate. 

3*4 identlficatl<m-6f -Seanner Irfcee^ 

Xhe aulti*^peetral seaimer that produces the radioBetrio data has 

6 independent sensors in each bandt and thus aa^ unevsimess in tiie 

response of those sensors idll produce a banding effect on the iBagS 

which would repeat eve^ 6 lines* In m effort to identify the 

Bagnltude of this effectt data from the CCTs were subieoted to the 

te^ique of power spectrim i^lysis* 

Xhis technique of analysis tahee a given sequel^ of data and 

reduces it to a series of wave f orra* SadS wave, Whi(^ is of the fcrm 

of a sine wave, is characterised two parameters, a wavelength (the 

hLstance between two successive poaics), and an amplitude (the height 

* 

of the wave fon}* The power associated lAth eadi wavelength IS a 
Beaeufe Of the contribution of that wavelength to the total signal* 
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VfauB anj walreloagth oorrespoodi^ to a pronoiaaood baadl&i; affect idJl 
jOot «a a peak on tlie ^aph of power against wawelangth, wMoh Wiil 
thus idontj^ strong periodicities present in tdie data. 

The analysis was Inpleaented hy use of the program published 
by fiavis (1973) . Bie data input was a siring of gray-scale saluee 
at ri^t angies to the scan iinesj pwallel to the flight path of the 
satellite^ with a sampling tuterrsO. of one scan line. 0!hus a^ 
periodicities uith a wawelength of 6 tines “Uie SampliJig internal 
could be associated with unevecmess in the sensors. 

The resulting power speotra for all foinr speotral bands wero 
oaleulatsd for several isagea over lengths of 300 snn lines. The 
resulte for an area of Ihe Norlh Sea (Tape G4) and for an area Of Iho 
Sierra Nevada of Galifpmia (Tape P3) are shown af KgUree 8 and 9. 
fieeults. not shown hersy were obtained for adjacent scan positions . 

demonstrated that the speotra were identical or very ei nll a r 
if a horizontal SMft wae applied. The re^tant plots have 

been SipothBd by applicaMon of the Hanning tria n gul a r window iPaMa 
1975) y and pov^ is plotted on a log^ithnio scale. 

The resists for the Wish image (Figure 8)y shows ti^t th«?e is a 
distinct peafe n 4»^ the wavelength of 6 sctn lines y identtfj^ng the 
unevenness of Ihe sensor e. On this inage all the ^ueS are very low, 
reflecthig the fact that the image ecnsists of open sea wMch is 
n^f ormly dark. Because of the low input values the resultant 

speotra are very noisy. However y the peak at wavelength 6 is markedly larger 
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than the other peake» wbihh are aeeoeiated with holee* tHie hei^t 
of this peak deereaees as the mean radiance linereasest iBidsTlns that 
the nagnltode of the enevraneas is siollar oh all hands, but that its 
relative inportance vuries inversely with radianoe* Kraph 
suggests that the vaii^abillty atlributable to this etfeet Is of the 
order pt ^ to 1 gray Soale v&due. 

The results iot Ha Siam Nevada (Jlgttre 9) Show the pr.ver speoira 
In the preaenoe of a much slrcmgsr signal* The parallelism of the spectra 
for the few hands ehovs that, for this imge, most of tiie power is 
related to sorface featurSs, Once agi^ a peak is visible at a 
vavelength of 6, and auggests an irregularity of about 1 gray scale 
value, that can he attributed to the aecuuaer* 

3.9 Kseussion 

It has heen ehovn that both the hon^unifornity of the radiance 
to iray scale transfer fraction, and the use of 6 independont aomsors 
for each apectral hand, cause detectable ei^eete In the imagery* 

The ixregulerlty of the hietograms is aoeounted for by the non- 
uniform distribution of m Priori probabilities, of grey scale values* 

Tl^ is most Sigaifieahi at bi^ radioaoe values, iher* it is s^uitMli|ent 
to an tmoertainty of about 3 In the gray Seale. 

The banding introduced by the unetjual responses of the 6 eensOTa 
casses an uncertainty of about 1 in the ^ay scale* This is most 
Si^ificent at low radiance valuea where the errors introduced by 
the transfer function are small. 
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ifasse tw •ffiBottt Sapote ft lover Unit of 2 geay scftlo vftlvoe in 
tho vldtii of the elBoaee vhioh can reliably be wed in preparing and 
CTy % p > y 4ttg gray eoale iiBtributio& biotograne* An ftlgerlttati ban 
been enggeated lihiob divldea the full range of V into 64 olaseee wllii 
•^ual a nrlorl probabillly, but tbe inportanoe of aenacr unevenneaa at 
lev radlaneo lovela inplies that 32 olaaaes vould be aore auttable* 


4. kfuAvAim af glatoapanfl 

4*i ihtrodttotlon 

Xoitlid «xand]iatLca of histogrens for sovoral avoao olumed 
that tSMoro VBS a poaalblliigr of equating oomponmta of tiie histograB 
vitii ourfaoo Igrpoo* Vhxis tbo possibility of splitting histogreBSi 
and ttaoroliy deriving ostiBates of tbs pox sitags ^ any inags eovarod 
by oaOh surface type was investigated* 

Che ihivestigatioa ooncsn^ated initially on an area uhioh sbeirsd 
a eiii^ls binodal histograqa* oould bo readily intezpretsd. 

This area vas peoft of iaage C4t which covered part of eastern Ex^ Han d 
and the adjacent aoa area* The land in this area consists mainly of 
sninll fields of tite same order of nagi^tudey or smaller* than the 
70 B resoltttlan of scanner* and thus the overall iaq^ression given 
by the ie of a unlfom gray speOkle* It vae thus not ezpeeted that 
titis technlq^ would bo able to dia'^gniSh between diffoi^t areas on 
the land* in contrast* the sea on the image shows up as a dark area* 
visible on all few bands* althou^ the greatest e<ntrast is o^ered 
by hand 7* in ihich the gray Seale values for the sea are nearly all 
either 0, 1* or 2* However* on hands 4 and $ the eonSiderable areas of 
nittd~flat* turbid wa^r and sand hank ^e readily ^alhle. it was 
hoped ^at for these bands it sd^t be possible to split the histogram 

into three components * thereby identifying j^oportlons not only of 

/ 

land sea but also of turbid water* However* the Mstogrms 
were in fact monomodal shoving that the two distributions that could 
be identified on bands 6 and 7 vere overlapping* and that it vaS Ju>t 
possible to accurately identify even two components of the histogram. 


4.2 


% 

A Itf gi ntBib«r of todmiquoa haTO Iwett dortlopod for oplittittg as 
obaorrod foequoa^ distribuHra isto tho sum of t«> or sore oo&ponost 
diotribtttiaiia. ClarlE (pori.eoom*} has rortoiiod ihoao aotbods^ asd 
voeognlaed tbroe sain grotqpai graphical » analytical, and numerical* 
Graphic^ tedmiquos roly on the subjoctL-vo into^retatLon 
of diaeonidnuitiea in the eunulatiro frequency eurre, and are thus 
Bubjeot to a hl^ degree of operator error. Audytical tedmiquoa 
are ezceptlohally difficult to calculate, and do not on ete^ 
oeeaaion yield a solution* Numericai teehniques are mu^ more 
taried, but nearly a^l use iterative methods to improve upon initial 
eatimatea of parameters describing the component populatLona. l!hi^ 
thus suitable for inclusion in an automated data handling system. 
Of the seraral sltemativeB available, it vas decided to adopt 
that of Jones and Jamas (1972), Uhieh uses a maxiaum likelihood 
method of eatimaticm. Ths mc^ reason for this decision vers as 
follovs: 

a) Althou^ initial eatiboates of the parameters to be estimated 
axe required, the method will eonverge to a stable solution even 
if tiieae are oonsidwab'Jy in error, and it uould thus alnays ho 
possible to provide a complete null set of isdiial eatiiaates . 

b) Ihe published writ included a FCRTRidT program, and So 

made it possible to examine the capabilities of this approach without 
the investment of a large programming effort at this exploratory stage 

REPRODIJCIBILfnr OF THS 
ORIGINAL PAGE IS POOH 
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c) Xt is possible to extend tii« ei3.stiaff proi^aa to inolnde l^^s 
ft'?;. eoBpcnieat distribution oilier than the Normal, The published lork 
incudes botii the elreular Nornal ^tributlon, and the neOhanisni 
for addine other distributions. One major dlsadeanta^ is 
that at present the existi^ program will handle only two components. 

4.3. teiflf_Outliae of^the MfrAH hood Method 

The maximum likelihood method relies on the fact that for ai^ 
observation set, X, and a statistical model wtlh paraoslar set ©, 
it is possible to caleulate the a nesteriorl probability of X occsurrfeg 
as the outcoioe of the stochastic process defined by tte model. 

Thus for any model it is possible to find, in principle, a parameter 
set e that will maximise me ^ J^eteriori probability, or likelihood, 
L, of the observation set X. 

In the case of a set of parameters refering to a mixtif e of 
distributions , Hie model becomes s 

Let the j*- underlying probability distribution be fj (z^), 
idiere j = 1 ,.,.,,, m, and Hie subscript i, i . 1 ,.,.,., n refers to 
the observatione. Thus for each component distribution, the 
probability of the observation set is: 

if the proportion of each componont distribution iS Q^, then the 
total a probability of the obaervalion set is #ven by 

' LCx) .|o, f 
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tsualiy* bowewr it 4® •«!«? to traasfom fheao firobabtlitles Into 
tho4n lofl^tthgpf end ri^i&co 12i® Buitlplioatioi& Iqr in adtttlonr ®o tint 


tin tiransforned oj^esaion to be nazialsed beeoaee 

A M ^ Qj ,1 


Tto ^ameier set 0 for the model consists of (m»l} valoeS Q^i 
fws i^e parameters for eaOh of tiie Sub populatlODS f^« Vets that 
because the Qj 'S must s«m to uidtyt (n ^ 1) values fully deteiadne set. 

In the case to be considered herOf the imderlying ^tributLonS 
are ail considmnd to be Normalt end thus hiive the form 
f£i^ = exf 


where ^ and d^ are the mean and standard deidation* Idnre there 
are two such disiribtttiOttSi onl^ a sin^^e propbrldniaiily parameter 


ie requiredt which we will call 0(*^ 

^us the total statistical model to be considered is 





2.r 


And the paraneter set thus oonsiSts of the five parametera 

f ' 1*1. ®« 3 

Because the derivatives ^e non^lilnear* it 4$ not possible to reach 
the mazimua livelihood Solution by direct evaluationr bat it is neeeeeary 
to Use a numiriCal teChDi^ua* The Jones & jameB ^co^am uses the 
S'^epeSt ascent method to provide a rapid edayerganee tow^ds the maximvonr 
and thereafter disness to the irewton Raphson method vhich is superior 
near tint ai more B«Bsltive to poor iritial estimates. 

. Rl?RODUC^lLITy OF the 
ORIGINAL FAGS ^ POOR 
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ia orter to be able to i»e tiie aazintB& liikbiihbpd nelSiod 

c 

it la &eee 88 ar 7 to have an observatloh Bet» raihen than a grou|ied 
Matojsrani* Hovefery tte hiatoj^aas of anjr one anea are the result 
of the exanlmtlott of a large number of pUxelSf and it vas tlno 
eoBSldsred oonventent to ooavwt ihe hiato^am to a xedueed obaervatton 
aetr pise^ as a tccem ot data reduoti(ni« Becai^e of Ihe integer 
natire of Ihe ^sTwecale values » it was poisible to reeons trust an 
Ohsmh^tlpu set wlih appgorinatejy me same proporMon of valuaa at 
each gray scale level as in Ihe ori^^nal data set* ^e program vas 
set to accept a data set of up to 2GO pointSt vas opnstrueted 
from histograms typically derived from 80,000 points. Repeated ^lysis 
using a larger data set of 400 points yielded almost identieal eomponent 
distributions f conflining that &ls ^ta reduction does not distort 
ms results* 

efficimoy of the program vaS ^h^Oved by the use of integ^# 
rather man real, arrc^ for Storing me data set* Althou^ miS 
required the use of me li^Af operation for calculations of the 
probability fvinction, it peliits a more efficient storing of the data. 

A histo^am having routine vas provided, vhieh used the line printer 
to ^ve a plot of me input Msto^am, me histogram derived from me 
initial estimates, and me histogram fitted by me program* 
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4«$ Reettlto 

Results vsM obtained f!roB the jro^aa for MatOgruM derived 
fr<m all 4 spectral bands, fhe data from band 7 yielded ibe clearest 
results, S^ee the bimodallty Of that hlsto^si was eitreme, vitb 
virtually no overlap between the two oonqMnents* iMtlal estdoates 
of the parameters were ebpam visually, and a total Of 30 iteraldtons 
were reqidred to produce stable estimates« A s anp le oulput is 
dnelided as figwe 10, and a stffi^y of the resvO^ts for band 6 ^d 
7 is C^ven beloirt 


Reanlts of Mariihiim L^iteeljihood E 8tiinat ion_bf 
at MiredJT.Qinal^Ms^la^bntimB 



As can be seen from the table, for band 7 population 1, whidi was 
identified as the sea component t covers 71»7^ of 'tte ^ea« ©10 coiwesfonding 
estii^te for band 6 however, was hishe?j at 76*9^, 

The results for bands 4 and 3 were cons-iderably less eneovraging* 

The likelihood response Surface poved to be very flat, and pkUS no 
stable solution seemed possible. in partioul^, the estimates of Ibe 
standard dedications seemed most imstable. 
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I^’ivtba^re tto most stable parameter, tbe propprtipn of l&e 
data in populatiott 1 eowrerged tovwds a value cheater than one. In 
a different context su^ a result, and the earresponding negative 
proportion for population 2 oan be interpeted as representing 
a sin£^e parent popidation from idiieh a ^stinctive Sub?population 
has been removed. Bowevef , in this case the eoncept of a negative 
(sttb^active) populatien has no interpretable meaning, and the 
rest^ts are thus of little v^ue. 

4.6 jR4.a9Mai.qa 

in^s pilot study has shown that the automated splitting of 
^ay^seale Mstogrems is possible, and can yield useful results. 

It has also shown that the results yield ^crepwoies between bands which 
veTe sufficient to Cast dpubt upon the applieabiltty of the method 
in general. In the li^t of these consideraticns , and the considerable 
compatatiGnal tme required to perform the analysis, it was felt that 
this approach, while possibly useful in Speeillc instj^Ces, vaS not 
suitable f^ routine application to UTS data. 

!Phe alternative histo^^ splitting techniques that are available 
might produee resuits more rapi^y, ^d be more readily applied. |n 
p^ticulsr metbpds that can handle more than two components need to be 
examined. EOWevnr, at this exploratory Stage it is neoeSSary to msira 
eonsiderable cheeks on the input histograms, tp corr^ these i^th 
gToimd truth* 

At the time of this part of Uie Study, the propiem of correj-ating 
tap® Goordiaates with geographiGal areas, had not been Boived. Thus 
the identification of the spectral responses of land types neGeSsary 
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for the interpretatioh of the hieto^ai&a could hot be pursued* Shis 
teehnique ima hot therefore tested fcrl^er mtil gore baekgrotoid 
ihfOria'&oh beeage athilable* 

It is hpveter pertiheht to eohsider the question tdiether other 
ana^ses hi^t not j^eid sig-ilar results as cheaply* Ih pcg*ticu|sr 
if it were posEible to gap ^Oimd types » it would be a sigple oatter 
to estimate thedr area* She poSsibilil^ of using either oluster analysis 
to ^oup podnts (or coilectiohS Of points) into hopefully reco^zable 
^oupS( or some fopi of diSipiiiihaht function to gap points onto 
& groups might be equally useful* Scsae of theee teohhiques 

we discussed belov. 
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5. THE USE QF GLHSTERXNG TECHNIQUES 

« 

As has been eii^hasised in earlier reports, clustering 
techniques have been used in an attest to Mentii:^ regionai 
subdivisions at a suitably generalized level within an EM^ 
scenes ihis approach has produced interesting results, and 
as they have been simfliiarised in the papers included in the 
Appendix to this Report, they are not dt®licated here. The 
three main papers concerned are: 

Gbjeetive generalization using IRTS unages. (to be published 

in: MyirQnmental remote sensing praetiges £u^ probleis , 
Edward Arnold, 19TS or 1976. 

MethodOlogiCai questions m the digitized Malysis of MTS data. 
Journal British interplanetary Society . 197S 
The relative perfoiSnaace of seme unsupervised clustering tecMiques 
for the per^ field ciassification of LANDSAT data. 

Read to the Eurdue Sp^siiih, June 1975, and to he 
submitted to Pattern Recognition . 





6 « Edge -Pe te eMon 

c 

6.1 Introduction 

Of the several techniques availahle for the eoinputer proeessing 
of pictures, one of the most fandamental approachea is to split the 
picture into noh’^erlapping zoneu bjr ihe debOctLon of edges betHoen 
approximately uniform areas (kosenfeld, 1969). ^re have thus he^ a 
considerable number of techuiques developed for detecting edges in 
piotiores. 

It was decided to apply some of these techniques to the 
ERfS Imagery, in an attempt to identify the kinds of featvU?e that 
could be distinguished by their sharp edges. In particular it was 
felt that this technique wOt^d be i^Seful for identifying areas 
on the tape where readily identifiable featwes. Such as l^es, 
are lacMng. tedinique would Idius be potentlslly Very useful 
in the i^igated zone of %e Central Valley of Calif arnia, where 
large spatial units occur in a regular reetanguiar pattern. 

6.2 iechniouea 

Commonly, edge detection procedures consist of two steps, the 
f^st an ini tied detection of potential edges, the second a process 
desired to 'clean iq)' the resultant image. Such a scheme will be 
adopted here, though at present only the first step has been implemented 
The standard technique in finding edges is to locate the places idiere 
there is an abrupt change in the picture function (i.e. in ttie gray 
scale values). This is performed "by ex amin ing the local slope of iiie 
picture function, and idiere this slope exceeds a given threshold value, 
an * edge* is detected. Various more sophisticated filters have been 
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proposed (eg* by Rosenfelid 1970 }» idilch exanine the dlffereiioee 
between average values either side of the potential edge, or which apply 
iatehed filters to ea<^ point in tmn* However, for the purposes of 
this investigation it was felt initlaliy that a fairly exude definition 
of an edge would suffice, and that any cleaning up of ^e picture 
^lat this would require could be carded out subeequently* 

^e <qtteratipnai procedure adopted was to exastne each point in ttam, 
and where the gradient at this point exceeded a ^eshold value, to 
demark that point as belonging to an edge. Thus an Initial progran was 


written which calculated the slope at each point, using the 


function: 



n*.. t \ a-x 


i-tu 



fes s,* 

L* 


Vhere ft;* ia the gray scale value at point ij. This v^ue 
f is thus the i^Muia difference between each point and its 
ei^t adjacent nelghboisrS. 


6.3 Besuita 

A ;^O^am was written to calculate these Valuss of , 
and produGe a bMary image of edge points* ^is has been applied 
to a portion of the central valley of California (Tape D3). Pig 14 
shows a gray scale print out of a portion of the analysed area, and 
Fig.15 the edges detected in that area. — 

As cen he Seen, this oparation results in edges which oLoSely 
correspemd uitb the field pattern as revealed in the gray scale printout 
hut has the unfertunate property tliat all edges are douhle, because the 
prograirne has identified each edge twice. This effect can be explained 




by fefeX'ence bo the one dimensional ease* If we scan along a 
line, and tind a large difference between points i and i + 
point i will be labelled an edge. However, this same difference 
will also be located when examining point i + 1, so that both 
points i and i +1 are labelled as belonging to an edge. 

TO eircumvent this problem, differenGes were ealculated only 
for one guadrant of directions, so that only three adjacent 
points Were examined * Thus the edge identifying fundtion 
becomes _ ’■ 

« Mi* [ CWvM - 0 ; 0^-^ i • " ' ) 

Two thresholds were used for the plotting Of Pij . Where Dij was 
greater than 6, a dot was plotted, and where greater than 12 an 
asterisk was plotted. A result of this program, for the same 
area as for the other two piotures, is given in fig. 16. Note 
that a much cleaner image results, and that very few edges are 
actual ly losti 

6 . 4 Discussion 

The program to perform the edge detection has been used as a 
standard display technique - 1^ has been found advantageous to 

use both a threSholded, compressed, map of edges, and a Straight 
pseudo~gray scale map of edge intensity , as an aid to displaying 
the data . 

Experience shows that the value Of e threshold, which filters out 
much of the noise, cannot be Set a p^or 1 but must be adjusted 
to the needs of a particular image. Work with the image Of the 
Wash area, using band S, suggests that a threshold of Dij ^ 3 
Showi Up many of the boundaries in the water area, but labels 
virtually the entire land area as "edge”. An alternative, 
higher value of Dij ^6 results in a totally "edge-^leSS" area, 
but a more satisfactory picture on the land . 

The routine has also been used to detect edges of lakes in the 
Sierra Neyada portion Of the California image, whero it also 
detects Stream lines. However, it is not satisfactory for 
detecting the ridge Lines that we know are present, larfely 
because the features change less abruptly than the Small 2x2 
neighbourhood of cells considered here . 
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A further development has been to apply a further thresholding 
technlgue to the threshoided image » in an attempt to produGe a 
"Gleaner” picture. If we set the value at cell Mij equal t© 1 
if Dij^f where f is same theshold, and equal to 6 if 


, we produGe a 
We then " Glean"* this map by 
adjacent to each cell, i.e. 


threshoided map of the gradients, 
examining the sum of all Cells 

^ +i 

k«-i r*-! ^ 


if the sum is equal to a given threshold, fs, we can assume the 
point ij is part ©f a Gontinu®hS edge, but if it is lower, we 
assume it is a pseudo-random effect, due t© features in the image 


smaller than our field of interest. 


Work by Yamada and Farango (1965) suggests a value Of _Ts = 4 . 
would be sufficient to identify linear edges, but we found this 
to be too severe, and that a value Of Ts = 3 iS better for 


However# these eleaned maps often lost mueh useful information, 
particularly where the image does not oontain strong, clear lines 
as was the ease with both the British imagery, and the Californian 
imagery away from the Gentrsi valley. In eontrast, in the central 
valley, where the main feature is that of sharp edges between 
fields » this "cleaning up" operation was found to be most valuable. 
Even here, however, it was not possible to generate Closed boundaries 
round uniform areas, as was the ©riginal hope. Nevertheless, these * 
edge detection routines have proved exceedingly useful teCliniques 
for displaying the imagery in a way that appears far less cluttered 
than the pseudO'»gray scale technique. 
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7. COMPUTER PROGRAMS 


this section does not attempt to give a full listing of the 
programs developed for this research: these will be listed and 
documented in the Final Report to NERr at the end of December, 1975. 
However, it is thought that this detail will enable the range and 
scope of the software to be judged. 

the section is divided into three parts: first a record of the 
various sub-routines which are used extensively in the programs 
which are described in abstract in the second and third sections. 

L_l_ Subi^ routines 

1. I'NREAD:- reads and unpacks an annotation record in the ERTS 
CCT format. 

2. ANNRD;« reads and unpacks an annotation record including 
the latitude^logitude grid data. 

3. HEADS: ^ produces a line priinter listing of the relevant 
information in identification and annotation records. 

4. BULKI:^ Gonstruets the mask for compiling square UTA's with 
sides parallel to latitude and longitude grid. 

5. BULK2:-p compiles a UliA using the masks generated by either 
BULK! or BULKS. 

6. BULKSi-^ constructs the mask for compiling polygonal UTA‘s with 
a maximum of eight sides. 

7. BULK3;- a self-contained program which will compile histograms 
(smoothed and unsmoothed). 

8. BULK4:^ will contour on the graph plotter rectangular aroas of 
the images defined by tape coordinates. 

9. Other routines include programs to unpack and pack the digital 
data into J words as used on the Original eGT's, routines for 
converting the GCT's to IGL readable format, and for producing 
grey scare pictures on the line printer. 

Gomp^er programs availabie at Cambridoe, UK. for the IBM 
system. 

fhese are referred to by DSN’s on the Archive tape (KMG 201). 


j 
1 
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KMC2.CLUS.VELDMAN LABEL = 1 I 

■ - ■ - ,i 

This program uses the eliustering method described by Veldman (1967) | 

to perform art hierachisal cluster analysis of data from the ERTS I 

tabes. The prgram can handle only 176 data points with up to 128 
variabTes per observation. These points are assumed to lie in I 

a grid li6 across by H down, representing the 200 x 200 'fields' I 

derived from the ERTS CGTs. The program produces a map of the v 

resul tant classification for the line printer for the last 10 stages 
of the clustering. I 

,P 

KMC2. CLUS726.ITER (■ LABEL » 2 I 

Program uses the Swain iterative clustering algorithm for 726 data 
points, 1.e, for 100x100 fields, and produces a map of the results. I 

The iterative algorithm prints out the transition roatri x for each 1 

iterative step, and a final interdistance matrix for the stable cluster ? 

( 

means. 

I 

KMC2.CLUS2880.ITER LABEL =3 5 

Uses the same algori thm as CLUS726.ITER, but takes 50 x 50 fields - I 

up to 2880 of them, and maps the output onto the line printer. I 

Since this program will almost certainly fail, due to its time J 

requirements, it includes a facility for restarting from an 
Intermediate solution. | 

KMC2.CLUS2880.NAGY LABEL = 4 

Clusters up to 2880* data points - from the 50 x 50 fields - using the | 

Nagy update of the Bonner chain algori thm. Options to the mapping ‘*1 

routine permit the same program to handle also the 726 data points of 
the 100 X TOO fields, and the 176 points of the 200 x 200 fields. | 

KMC2.TW0TAPES LABEL = 5 j 

Examines an area covered by two different CCT's. The tape | 

coordinates of a common point are required , and the displacement I 

between the two assumed to be parallel to Satellite path and scan | 

direction. The tape coordinates of the area to be examined on the I 

first tape are also required. Pseudo-gray-scale print outs of the tv/o 1 
examined areas are produced, and a map; also di f ferences between 3 

the two maps. A histogram of the differneces is also produced. The 
program has facilities for pcrf ormi ng multiple tasks for one entry of | 
the program. | 
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KMC2. LINER LABEL = 6 

Uses a local edge detector to define sharp gradients In the Image. 

The ideal slope can be mapped as a pseudo^grey scale to produce a visual 
plot of high gradient areas. The gradient map can also be ‘thresholded’ 
to produce maps of 'outlines', which can be variously thresholded. cleared 
or compressed. 

KMG2. FOURIER LABEL » 7 

Using the program written by Davis, this takes the Fourier transform 
of a line of data taken down the scan. The object is to Identify the 
magnitude of the 6-line problem. 

A graph of the power spectrum is produced, and facilities exist for 
perfonning this at multiiple points in the tape, and for a choice of 
spectral bands. 

KMC2.CLUS.ERTS.NAGY LABiL = 8 

Performs the Nagy classification algorithm to individual pixels from an 
ERTS scene. These are mapped out on the line printer. Facilities 
exist for the suppression of the 6- li nr -Streak pattern. 

KMC2. BAYES. COMPILER LABEL = 9 

Compiles a data file of mean and eovarianee matrices from an ERTS tape 

from a set of coordinates which identify training areas for a Bayesian 
classifier. The training areas are assumed to be identified a pri ori . 

The output is a set of files to be used as 'training data' for a later 
Bayesian ■classification. 

KMC2.0LDMAP.WASH LABEL = 10 

Performs a point-^-by'^ point cTassifi cation of an ERTS Image using heuristic 
multi "dimensional density slicing. Present version set up to identify 
water features of the Wash area. The output is to the line pri nter in 
the first instance , and then to the graph plotter for final output. 
Elementary geometric corrections are included.. 

KMC2. BAYES. CLASS LABEL = 11 

This program performs a maximum likelihood Bayesian cla: si fi cation on 
a portion of an ERTS image, using as training data the files produced by 
program KMC2. BAYES. COMPILER. Output is a classified map sent to the line 
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KMC2.HIST0.GRAMS LABEL = 12 

Compiles the T28-leve1 histograms for regular 50 x 50 fields for a single 
ERTS tape. It thus needs to be run four times to Gompile the histograms 
fj^ojr an entire ERTS image. The data are output to a file which is usually 
held jn magnetic tape. 

IWG2. TAPE. LOOKER LABEL = 13 

Using the utility routines written by Ian Hill to unpack the header and 
annotation blocks from a tape, and to unpack and list the first Sean Tine 
of that tape, 

KMC2. GRADIENT 

Produces a gradient image from a given ERTS tape, output onto a second tape. 
KMC2. PROFILES 

Plots single scane liines from an ERTS tape to the pine printer , Has the 
option for multiple plots from one program entry. 

KMC2.CGMPFILE 

Goropiles data files for TOO x 100 and 200 x 200 fields from the smaller 
fields produced by KMC2.H1ST026RAMS. 


7.3 Support programs av ailable at UEA . for use on the IGL system. 

MAPPING; Uses the graph plotter to produce a shaded map of a classified 
ERTS image. The routine allows for up to 13 shading types, and the boundaries 
round all non- identical types are drawn v The main program is Set up to 
handle any of the three formats of data, i.e. 200 x 200, TOO x TOO or 
50 x 50 fields, and any degree of Skew. It has been adapted to plot small 
amounts pf pixel by pixel data. — — " 

LISTING: A catalogue program to list, with comments, the current 
holdings of IRTS imagery. % 

HADCALL: Routines to Galaculate the Hadamard transform of a given data 

string using the Fast alogorithm. Intended for eventual use as part of a 
texture investigation. 
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MAXLIKE: Uses the Janies and Janies maxliniunl 1 i^kel llhodd prograin for splittiing 
one-ditnensTOhal (marginal) histograms into two normal components. Ihe 

printed program has been amended to take as its input a caTCulated 
histogram. 

SLINK: lihe Sitson Single link cluster analysis, adapted to aGcept the 
means of 200 x 200 fields. This program is run on the GDC 7600 at 
Manchester, and ineludes an option to draw a dendOgram of the results. 

QUICKPLOT: Draws on the ICL graph plotter, a single |RTS scan line, from 
data stored on punch cards . 


NLMA; The Salmon non-linear mapping algorithm. Released to A. C. Armstrong 
on a personal basis from Imperial College, London. Used to project 
the means of the 200 x 200 fields from the original 4-dfmensional space 
onto a 2-dimensional space. Modified to use the graph plotter as its 
maiin output device, rather than the line printer. 

****************** 


IIOTODUGBIUTY OF 
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Figure 1. Position errors for "iiie dam uall of Honey Lake. 

!Ehe open circles show the position of the dam (marked by the hexagon) 
as fomd from tlie latitude - longitude grid included -Kith the CCT 
annotation data. 




ERTS IMRGE CGNTGURS 


CONTOURS for the V^ASH 

Figure 2 , This contom' map of graj sca3,e values for the sea 
area near the V/ash, TJ.K, gives a good indication of iiie distrihution 
of sedinent laden water. 
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UnBUioothed gray scale histogram for MSS band 7 taken 
from & UTA in the Central Valley^ California* Note how much smoother 
this distribution is than the corresponding distribution for band 


5 shorn in Pig. 7. 
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Figure 5 


Tape Count 
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Tigupe 9* powei* spectra for a strip through the Sierra 

Nevada ran-e. Axes ore as for Jlgtare S. 
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Pigwre 10. Sample output from the hietogran splitting program. 
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Grey scale map of area in 
Central Valley 


l^gure 14* A eomputer prajit out of aii area in. the Central Valley 
The field pattern is clearly visible in this band 7 image and 


an lirlgatiGn canal crosses the south-west corner of the area. 










15. Sharp boundaries detected In the area, showi to Mg.14- 
iiHtoe toe eight nearest nei^hours method. A double threshold is 
used eo that stars represent a larger difference to gray scale ■value. 



Field boundaries usin g 3 nearest pixels 

- -•••- ■■•■■■■•-■ -- -■■■1 ■--- . - . -- • - »— » ■ • 1 - 1 ^ 

in NW quadrant 


Figure 16, Sharp ■boundaries detected using the three nearest 
nei^hotu's in the north west euadrant. ITote that the boundaries 
BXQ more sharply defined than those in Figure 15, and that no 
significant features shovnx in Figure 15 are not also sliown here. 
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The paper presents an outline of a research prpjeci 
usiiiK data from £V{TS-1, It involves reading the 
computer compatible tapes suppUpcl by NASA for a 
limited number of scenes. Extracted from these . . 
are data applicable to a number of smaller areas 
(ITnil Target Areas) of 50 km side and for each a 
freciuency histogi'am is produced for the I'csponse 
within the four wave bands of the MSS system and 
the three wavebands of the RBV system,. . Those ■ 
liistograiTis are then compared foi' successive 
orbits to detect and moi-itcr change. 


1. INTRODUCTION v ^ ^ ^ ^ ^ ;; 

As remote sensing platforms develop and become more remote from the surface of the earth C 

that they are sensing, the resolution of the sensors becomes increasingly important. As the 

altitude’of opcratidii of the platforms has moved higheri from low flying aircraft to high altitude y 

reconnaissance planes, rockets and balloons to finally the oi'biting satellite, tlie ability to detect 

small objects lias for the moment been sacrificed in order to scan larger sections of the earUi's 

surface more rapidly. As many of the world’s, centers for research In remote sen.sing started in ji 

the field of interpretation of aeriaTphotugraphs, norinaliy flown at altitudes of less than 15, OOC) ft, 

and with camera arid film systems wdiieh were fully recoverable, it is hardly surprising that the.: 

majorily of'researclr effort: direct to tlje use nf sensors mdUrited oh:ortri;brig:piatformS shoLtid ■ i 

continue to be concentrated at or near tlie absolute resolution of the system. Indeed some of the 

research proposals; made to NASA for; eo'opet’atlpn: in the ;ERTS scheme .would appear to be beyond ■_,! 

Uiis liriilt Of resolution. Because an drimanhod satellite must: at present, use non- recoverable : ; ' - . ' *): 

methods of remote sensing the resolution ol tlie imagei'y is determined largely by the fact thav it i 

is built up from EcaTming sensors^ : eacb scan Hne being broken up into a large number of units .. i 

kno;wri as instantaneous fields of view’. In the ERTS program the iasiantarieous field of view is of i 

tlie order of a square 300 feet op the side. The response from such an area makes up. an average 
■ response frdm; all the Stmfaces’ ^ t A. ty.picat research project iirvojvrrig theiuse of hi ' ■ - ; 

. altitude'aircraft Avith multispectral scanners might involve sampli.og a number of smali_ai’6aE in . 

_ the imagery and producing spectral: response signatures from, these samples and naatching them to , - 

^' ImovVh sigriatures in 'brdBr to allocate each sample plot to a known type of land use or land surface. ^ r. 

Such work, witii. also visual interpretation of the photographic imagery built. up from the scan line 
: ■ - data, ycoritinues-inmariy. forms with tbe'ERTS.impjeGti; There are t^ ^ 

using this data from KRTS: ■ : ■; 

: . : l .l- it is obviously importarit to a-dapt tlie Mplysis Of. the rernotely sensed data: to doe scale Of the } 

data gatliering system, A Silollite system such as ERT.S is perhaps a forerunner to a worlJ-wide 
data galherv.ig and monitoring systenu On this scale looking at instantaneous fields of view would 
seem rather like looking at needles in hayalacks, 

1.2 The potential volume of daio from a satellite based system is very large, t he data 
acquisitioh rate froTn ERTS-1 alone, vdth its limited i’ccl tlJne arid recordiilg canaMItty, is estimated 

at 15,:ji :bJ picnire elements aweni; for u year irrdc» 0 U.. t.py2b :RepeUUve coverage is. an i.nuoi-iAm 
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FIGURi-: 1 . data fj.ow 


asKCt of a sateHito sywlom nnJ tlitfi'cfore an effective sampling Blructure is difficult to estsbUsh 
and it ie valuai>lc If all lh« avaibtblv data can be Kcaitnt'd to detect change over sncceesiyc orbits, 

2. TUiO OlbirCTIVK OF 'nii; HRSliAHf U PnOJlJCT 

2. 1 With many research pstablislitncnts having a long and dintinijuislmd history in the field cf 
remote Hcnsing they Ituvc [iccmiinhiled nccosnary i upilnl equlpiiient and expertise to continue to 
develop roaoareii (.fCorls uloiii; Iradilional linca. The first objective vvoa to identify a now direction 
for rcscitreh with KutcUite data which would not rei|uire large invoHtmnnls in capital equipment and 
manpower and yet at the same timo would not he repeating experlmcnta completed elsewhere, 

2,2, Hetnoli? sciiKing data have little intrinsic value until they are used by someone else to make 
deeiaions or to facilitate action wliii h beneltts nnin {.Scbn.inipf, T.)72), The second objective was 
to devise a syeteni making remote sensing data more readily avaliable for such ubch. The data 
needs to be available in arc:4 units v. hicli have a BCitlc uppropriMte to a data gatliering system for 
the vvliole world, Ak Hpted is esscivtial, not only to polentinl users but alaf' h<"'ac'-,e of tlie 18 da’.' 
KHTS cycle, it may In; tiesirahlc for llu! visual ins|H ciiori of the photof ipliic uir tviy lo be 
circumvented, and iJi’oduced Oiily for those areas Vvdiich are picked ou'. f,S intpj'estlng by an 
automated data .scanning system. The design of such a system becar te the aectmd objective of the 
research, 

3. Tin; DATA SCA>:\1N0 SVS'fL'M . ' • 

3.1 THD DATA llSI.n ^ . 

The precision computer toinpatil.lc tapes, r:enl,ainlng data frt>m the four bands and tlie 

three HHV bands, or sucli of Ihuse asi are available, are used as the data input (fig, 1), The 
precision data is reijuired because of ibeir greater rndioinetic accuracy and because the scene 
they represent is f-patially regisU red on the basis of ground inforni.-ition leading to a reduciion in 
marginal errors vvlu-n identifying the same area of ground o'^er successive orbits. On the otlier 
hand this goes agBin ;i one of the i>hjL'ftt\'o& of th*- research, in avoid the pholograpliic iinagrty, as 
the precision computer compatible tapes are foriiicd by ecannlng the 9 inch square plates rv-suliing 
from the precision processing. With flHT.S-lt weTmpe to use sorne of Uic bilh data but at this early 
stage of devcloinuent accurate scene registration is esficutlai. The eight computer compalifilc (apes 
are ayailaLile for eacli scene covering approximately an area 100 nautical miles square ami tlifcse 
contuin all the MMS and the HBV data. These tapes have to he read and searched to find the data 
referring to the selected areas of the earth's surface, 

3.2 THE’ UNIT TAHGRT' ARRAS 

The sct.anncrs, by which the precision Imagery t» searched to create the computer compaUble 
taper, have «n jnstantaneoua field of view which is called a picture element, It is the intention of 
thlB project to combine these picture vletnents Into much lai'ger areas (Unit Target Areas) and to 
produi e the average .spec tral response within these areas which can then be compared over 
successive orbits. The size of this unit targev area will be a manor of experiment but it needa to 
be an appropriate scale for mapping large areas of the earth. Inllially the UTA has been fixed at 
50 J«m square and will be so selected tliat each falls within one scene examined by RRTS-1. Two 
snclt unit target nreas^ arc being used in the first instance, one in tin- United Kuigdora and the 
second in Central California, nds second area was chosen as an area vvith less cloud cover than 
in the United Kingdom and as an area where there would be little difficulty in obtaining and 
i-'lerpreting the RRTS-l data. 


3, 3 UNIT TARGET AREA DATA ANALYSIS 

The computer compatlbl ’ tapes are scanned for each unit target area and Uic responsea for 
each wave band for each sensor are arenmolatcd into 13 class inUu vals. The resulting seven 
histograms will be printed using a CUT display unit. These hictogi'ama become tin i>,ois of ail 
later work as they form a type of spectral pcsponae signature for the unit target area. Known 
radiometric errors in the sensers and in the pj'ccision processing can now be used to give conlidence 
bands to each of the histograms. The computer cornpetlble Upes for later orbits will then be 
exiimined and the unit target area data extracted after careful scene matching. This second, end 
all .snttcf.Kivf , response histogranvs can be compared lo the first to isolate clianges in the spectral 
response of the unit target area over time. 

3.4 IDENTiriCATlOK OF cnANG'’S 


If the liistograms shew change greater than the Confidence limits the photographic product* 
can hi- rvnmined together wUh Ground truth to discover the nature of the change. The most obvlcus 
cho).;^>- wiU tio in the amount -n chr.id cover. Changes In dit seven bistogranvs are a response 


sss'.'ssr 




tij.<‘hat!t'< '.4 iit ( loiuJ covM;vr. eari ijt* partly Ijy vyrtipl- jnspertiu-i .tad partly liy a separate 

it.vamiiutiori oi' cloud free sections of the unit target area. Once the iwture of changes broiight 
about ijy t'lmid cover ar** rec<»gniRrtl.h*tnre',;analysss,vl»l fe inyipdi a-calcalatlort ofthe: 

percentap^o of cloud covia* autoiiiuticRlly tvliile looking jqr other .signlfirHnt changts in the 
resppn.ses* tilvould no such clianges be recognisable then the data for ibat orbit can be rejected, 
the photographic etidenee not rcfniested, and the j)n;*t orbit can be rahmared directly with the 
rirst. ' ^ ■ 

1. APPlJCATIONS 

N'o data bd.s been analysed so the results are ‘hose anticipated both In the long and itliO short : 
term froin such a system. Tlds rt-search program is designed to provide a method of analysis, 
not to produce directly usable resuiUs from Ucc data. This is in keeping with Uto expcriinenlai 
nature Of tlie EUTS project; In order to provide for as tvddo an application as possible the 
software is being produced fo; an IBM :iT(.i/lQ3 computer hy.slom usin;' the nine track computer 
compatible tapes. Future ap plications lor Uiis rpaearcii are nuineroua and include the 



,, they ; 

to computer mapping ir various fornisi For some purposes the size of the ...at target area may 
be changed and the computer programs will allow for such .ilteratlons. Eventually an 
envdromncnlal watd) could bo proserved over large sections of the glolte without having to process ! 
the pliotogniphlc produtls continuously. Such long *erm developments fiasunto that future s.atellite ; 
systems have better scene registration 'vithout ape. ial precision proeessiug or that tills I 

procesEing wiii not involve Use preparation of pbotograpblc products and it also asssimes that large? 
scale pi’oeessing of computer mpes will bo possib’e at fur greater rates than at preacsit. Such • 
dovciopmeiua do not seem unlikely along with increasifsg accuracy of the Sensors in use, 
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SYNOPTIC OBSERVATIONS AT A GLOSAt SCALE 


KEITH CLAYTON 

School of E/MrQnnmtal Sciepm^ tfoiv^^^ 

NOR 88C\ England. 

• Ea>’th observation satellites arc kitdwn to provide a basitforinappinB both 
l^rnianent and more ephemeral features of the Earth’s surta^^^ 
so far has been ori. their patentiat for detecting hazards such as floods, forest fires, 
etc., it is here suggested that their role in providing a uniform data base for map- 
ping global distributions has been underestimated. 


I. INTRODUCTION 

OVER THE PAST few years talk of what might be done with satellite imagery 

of the Earth’s surface has greatly outweighed practical applications. To some ejt- 

tent tills situation has continued longer titan we might have anticipated, with ; 

the delayed launch of ERTS 1 and the slow arrival of imager/ in the UK. 1 apolo- , 

gisefpr adding to this class of lilefature, but I feet tiiah despite its volume, it has ^ - 

left considerable areas unexplored. 


2. PHOTO-INTERPRETATION 

So far two major areas of application have been examined; the scone being sub- 
jected to a suitable roodificatioh of convcnEonal air photo interpretation. Details 
ofiand use, land form^ and in suitable areas such clemchts as gepiogy or soil, arc 
interpreted and mapped from the .scene, using either a single wave-band image or 
pciiiaps more effectively sonie fotin of canibined image such as that produced by 
an Addcol vieviTer. In little-known areas new mforniatiori is readily obtained by 
this approach, while even in well-mapped areas new develppmehts that 
the latest maps will be found, or ephemeral features such as floods or savanna 
, bums may be mapped. Most of this infprmatiph is produce by interpretation at 
or near the limits of resolution of the imagery, and the investigator is aware that 
improved resolution would add greatly to the information he could extract. 

A more hovel aspect of this approach is the occasional value of the small- 
scale and large area covered of the imagery for reVcaJing features of such a size 
that they are hot rcadhy apiffeciated Oh conventional aerial photography. The 
most obvious and common examples are the Tccognition of major linear gcologi- 
calfeatures, and anumber of examples are already recorded in the literature. 

3. MONITORING CHANGE 

The second major application of satellite imagery of the Earth's surface is the 
value of its rejpeUtiVe coverage iii changes over time. There is no 

denying the fact that even the 18-day cycle of ERT.8 1 allows the detection of 
relatively .shortTerm change floods or forest fires, as well as the surveM- 

iance of seasonal changes in vcgtlatiou { natural or cultivated) and such things as 
snow coveri Coinparisbri^b^^ will give rise to an impressive literar 

tnre and some intertsting (and potentiaUy useful) examples are already on record. 
An example is the chance occurrence of a major savanna fire in. the northern 
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l^ifl of Soutii Atrica on successive passes of ORTS 1, Since the fire occu^cd in 
the area covered by two adjacent scenes, change aftcToite day was observed Wo 
already have satellue scenes showing inajor fioods, :md will soon add valuable 
records of ephemeral vegetation gmwth in desert areas (poierttiat sources of lo- 
cust infestations) or a record of the extent of landslides la some mbuiitainous 
area after a major earthquake, 

Such successes must not blind us to the probienisof regular sur/eiUahee of 
change by satellite imagery, WcrcERTS 1 technicafly capable of monitoring 
every- scene and returning the data to Earth, even this svslem would produce 
nearly half a million images a year. For fioods or forest fires, coverage on a onc- 
uay, rather than an 18-day, basis would really be needed, and. this would put up 
the animal total of scenes to t^ million. In additioni there are curreoUy four 
iitiiigcs for C3ch sccnc^ further incrcAiiiui; the number of linages involved In global 
surveillance. The tmre taken to receive and took at these Images must also be 
taken into account. If the returns are worthwhile a sy.stcni can be of course 
developed, even If it consists of no mure than a vast team of trained photointer- 
preters, ! have not considered the ailditional complications of cloud cover at the 
scale of real-time gipbal monitoring: we can suppose that new techniques will 
come along to allow penetration of cloud. 

Wlietlier we arc considering a monitoring programme per se or are just facing 

up to usmg the vchiniinous output ot an Earth satClUte system, there is a strong 
case for automaring our initial recomiaissancc of the output. This is the first 
objective of work If; progress at the University of East Anglia j 1 J . It rests on the 
supposft .^ii that, once one good quality scene is available for an area, we are 
only likely to want to spend time looking at other scenes if tJiey are different, if 
they show seasonal or other more irregular changes. If we can monitor the 
satellite output by machine, or by madiinc-aided techniques, we have the chance 
of identifying dupheate (or cloud-poyered) scenes, and so jiiny greatly reduce 
the number of scenes which deserve more detailed work. This is data-reduotion 
by the recognitim.' and elimination of duplicate data. 


4. generalization OF satellite imagery 

"^^fcv^view has covered what I seoas the two main, approaches We have So far 

scenan tlie utilization of Earth resources .satellite imagery. In each case the data 

is reduced to a manageable aiupunt, cither by 

area or by concentrating on change and discarding those features that are fixed. 

It jloesmot seem to luive been appreciated that tliese two approaches do not pro- 

«de information that may readily be handled at the cohtinental or globaf Scalu 

Yet surely wC^wili be fading to exploit tlie most fundamc offered by 

orbiting satellites R wc fail to work at a continental or global scale alongside the 
massive amount o. more detailed interpretalion already undet way. i shall not be 

concerned with the iiscfulness of suchgcr eralization but 

niques by which gldbarsynthests might be brought about. I am quite cottvmced 

that generalized mapping and interpretation at coritinental and global scales is 
most Worthwhile aim, and h,ave no doubt that the new information which cCn be 
extracted fiom satelUtc imagery will open up new areas of rtrscarcll and allow 
neiv approaches to environnientalrelati6n.ships at and near the global scale. 

\/hat we must first understand is the need to generalize and reclassify the 
images we obta;*i from orbiting satellites if we are to work at the global scale. The 
scenes are 0, cciurse generalized to some extent by the scale of the imagery and 
tlilsis reflectml in tlmliniitsof resolution which makes ft difflcult, for cxmnp^ 
to discefn city blocks and of course qidtc impossible to see hidividual buildihg^^^^ 
Yet in sviUable areas, where fjolds are large, iiidividuai fields can be seen, so that 
thJimageryiinksto-someoxtehtwUhomgrouiid-levcFf^^^^ of our 

bit of the world as a inosaic of fields and built-up areas. Even this suggestion of 
a field scale suggests that at the margm ef interprctablEly ihe ERTS I images 
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couSd yield data niat>pablc at about I ;1 00,000. The small ndd.s typical of 
England arc identified on our 1:25,000 maps, but as the pre-war sheets of the 
Land Utilization show, land-use may be njapped at 1:63,360 on the basis o'" un- 
distorted representation of field units as mapped at a larger (1:10, 560) scale. I 
am not suggesting that there are many landscape eloments that might be extrac- 
ted from the liRTS 1 imagery at a 1 ; 100,000 scale, but this seems a reasonable 
esiimaie of the largest scale at which current ERTS 1 imagpry might he mapped. 

If we turn from the consideration of large scales to the world scale, we move 
from 1; 100,000 to 1:50, 000, 0()0, Which is almut the largest seule on which the 
entire woriti might be shown on a single sheet. Tliis scale shift of SOO tunes 
carries with it a number of implications. First an indivirlual EUTS I scene fs 
going to occupy an area Of no more than 3-4 miilimctres square, so that in practi- 
cal terms it will hardly be possible to divide a single scene into more ti»an about 
3 categories. Indeed, even the recording of a yc.s/no situation (e.g, presence or 
absence of a visible area of surface water) will produce a map with as much 
information as a botanist’s distribution map (Fig. 1 ) based on prc.senCe or absence 



Fig. 1. Distribution for two plants plotted by 10km grid squares on a present/absent basis. 
On the IcUAsiJcrtila cytumchwa L. is closely correlated with limestone outcrops, and in 
particular the chalk of south eastern KngJand. On the righ t an upland distribution is tc- 
piesenled by the distribution of laccnm/nt viris-idaea L (Cowberry). (Mxet A Has of the 
British Flora ed. F. II. Perring & S, M. Walters, 1962). 

of a particular plant in grid squares with 10km sides. ERTS 1 imagery is for too 
incomplete to allow global mapping even of this simple type, but it will become 
feasible with some future satellite systems. We should look ahead to that system 
and start thinking about what sort of dhurabutions would be worth mapping at 
the global sea ke, and what data-processing sysi e.m will extract and map the infor- 
mation from about 20,000 scenes (6G00 for land areas only) most efficiently. 


5. GENERALIZATION AT THE GLOBAL SCALE 

A second aspect of mapping at the continental .scale is tire classification of tlie 
information portrayed by Earth resources salctliles. For most of fite phenomena 
wc are likely to map, the stnall scale of tlie ERTS imagery will involve the map- 
ping of associations of element si, and these assoclat ions will rend to become ift- 
crcasingly diverse and hence increasingly gencialtzed as descripti\e units at small- 
er scales. We arc familiar With this in tettns of thematic maps where generaliza- 
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tion and a classification of t)ic data arc both involved when map scale is reduced. 
We have already referred to Land Use, portrayed as field units on the 1:63,360 
scale. The '10-mile* (1 :625,000) Land Use map of Great Britain was able to use 
the same classification as tlie 1:63,360 map, but had to agglomerate .small parcels 
of each category into irregular dots wliose area was balanced to yield the correct 
proportion of ciich land use category across areas about a cenlimetrc square. The 
same app'o.ndi cannot be used to produce a map at any smaller scale, and a land 
use map of Western Luropc at (say) 1 ;^S,000,000 must adopt a new cla.ssificalion 
based upon Land Use types or complexes. At a still smaller scale, such as a world 
map, it would be ditficult to sustain more than two or three .subdivisions within 
the cultivated land category, and the re.st cvf the subdivisions would necessarily 
lean licavily on the major nalurnl types; of vegetation, wtiicli are attractive units 
at this scale because they cover wide areas and yet we can conceive what they 
look like. However, many of the distributions we are likely to discern on KRTS 1 
imagery do not fall into such convenient categories and vre shall have to build up 
complex units that can bo expressed in quantitative terms. 

For some purposes these synthetic maps are effective illustrations of the 
distribution of complex natural phenomena — soils, vegelation, landform, for 
example. But they are at the same time difficult to compile objectively and per- 
haps even more difficult to analyze. My own preference i.s for Ihe compilation of 
single variable maps, either on a present/absent basis as already suggested for 
open water, or on a .simple quantitative scale (Fig. 2). While some of the possibili- 



Fig. 2. Tlie frequency of pits and ponds in Norfolk plotted in four groui)s on a .semi-quantita- 
tive basL'. The peicentrncs refer to the proportion of suniplo arca.s cnnt.iining pil.s or ponds 
(After A. R. Cartwright, unpublished MPhil thesis, University of East Anglia, 1972). 

Jities sesrn rather simple at first .sight (such as my open-water example) we may 
still have a lot to learn from the assembly of tliis information in a uniform and 
consistent way for the whole world. The analagous map of biological distribu- 
tions I have already referred to reveal far more than migiit at fust be expected 
from data that records no raoie than the presence or absence of a certain species 
over units of lOOkin*. I’art of the value will be the consistent nature of the data 
base over (he whole globe (1-ig. .3). Existing world maps depend on a data base 
that i.s unnecessarily comples: and detailed in a few places and almost nosi-existent 
over wide areas. Willi ttie best will in tlie world the compiler cannot eradicate 
these inconsistencies. It is true thai ERTS 1 data will be most easily u.sed in areas 
where ‘ground truth’ is best os'at'tisheiL Nevertheless, •; provides us with a far 
more consistciil data b^se liiati we have ever Iiad b.. 'f re. 
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Fig. 3. Lielitning: a distribution map for tropical f July '^^ee 
derived from con^utcr one Satitu5in;i 

rtictc is some variation q ■ possible lo map lightning 

//fl/Hfc. 232 (5312), 20 August, 1971). 


Finally an important feature of any global mapping or satellite-derived data 
is the opportt.nily of repeated mapping to pick up .seasonal or annua! changes. 
rdiLtmt t£^ of concerning ourselves with shorter term changes 

partly bccau.se of the dalu-hanrlling problems 1 have already 

hccaLe it is important to match scales in time and space. Such guu.rah/.^d changes 
marusefulTy be referred In as synoptic, and represent a potential m world-wide 
manning tlial i.s entirely new. Tlie nearest approaches are in tire p.oneer gcncrah- 
zcd^cloud cover maps produced by the repeated superimposition ot meleomlogi 
cal satellite images or the mapping of parameters such ?iS albedo from a 
mLe"^ satellites (Fig. 4). There is no doubt that they contnbu- 

ted a new view of the global circulation and a new analytical device. I feel it 
would be a pity if the inevitable fascination with the margin of intcrprctabihty 
of satellite imagery led us to neglect the potenUal oi global syntlKsis. 



8fi4KffitfrJsnsss 

latitude, should be noted. (Redrawn (toiuJoumal ofApp/iddAIi’r^orology, 9, I!) 0), 
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A COMPUTERISED SYSTEM FOR IDENTIFYING CHANGES 
IN THE EARTH'S SURFACE COVER 

I. E. HILL and J. R. TARUANT 

School of Environmental Sciences, University of East Anglia, f(prwich, NOR 8CC, 
Engtaml. i 


The need for and design requirements of a rapid data filtering system for seleo 
ting satellite imagery containing significant new information on the Earth’s sur- 
face cover are discussed. One possible method of performing this filtering is des- 
cribed. 


1. INTRODUCTION 

THIS PAPER DESCRIBES a computer based system we are developing for the 
analysis of imagery. from the ERTS-l project. Our aim is to test techniques for 
the automatic recognition of changes in tlie Earth’s surface cover and to develop 
these teclmiques to a state in which they could be incorporated as a ‘User Service’ 
in an operational remote sensing facility. In this context ‘Users’ can be regarded 
as those who would utilize Earth imaoerj' as an additional source of information 
to aid, for example, planning and administrative duties, in much the same way as 
meteorologists use NIMBUS satellite data. A water supply authority might com- 
bine satellite imagery with in situ measurements to estimate the snow cover in 
the catchment area of a dam, and use this estimate to decide whether water 
should be released from the dam. 

Although the computer system had been designed around ERTS-1 and the 
data products made available by N.\SA, there is no reason why the techniques 
should not be extended to any other satellite based remote sensing platform, and 
for tliat reason the discussion in this paper will not be restricted to the ERTS 
system. 


2. THE NEED FOR A DATA FILTER 

Tire applications of the data from an operational remote sensini! satellite v.'ill be 
determined by tlie advantages of this imagery over that derived from other plat- 
forms. This might seem an unnecessary statement but surprisingly little work is 
being done to realise the potential of one of the major advantages of satellite 
imagery, namely regular repetitive cover of large areas of the Earth’s surface. 
Satellite platforms are well suited to monitoiing changes on a regional scale, such 
as, for instance, crop ripeising or the spread of crop disease in a large agricultural 
area. If information of this kind is to be of practical use to administrators, it 
must be available in a compact form snd, most importantly, in near real time, 
both of which imply the need for an eificieiit data reduction system. We are con- 
cerned here with only the lirst stage ot ibis process, an iniiiai filtering stage for 
selecting for further analysis only those images which show significant changes 
hi the Earth’s surface cover. 

The choice of a coiupuicr based ;.:ysfeni rather than a in,^nual one is dictated 
by the quantity of data an ooerational satellite platform could produce and by 
the format of this data. A saiellitc coverinf. the whole of rite U.K, once a week ^ 
with imagery in ten dilfereni spectral bands would require an analysis rate ot 30 
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or 40 daily. Miich of tris inforniaiion will b'.- irrelevant to any particular 

user not only by virtue of its geoj'raphic locality or spertral band, but also be- 
cause it is effectively identical to earlier imatrery. Rejection of images with no 
new infurntalioii requires a '.evil defined criterion for deciding viiat constitutes a 
significant change and rhi.s criterion must be easily updated to allow for different 
fields of interest of different users and variations in the type of change expected 
with, say, season or locality. Pfovided a fast and simple algorithm can be made to 
implement this criterion, a conventional computer can easily perforin this type 
of repetitive analysis on a targe data base. A team of trained )>hoto-interpreters 
would do the selection as well as a computer, if not better, but only at the ex- 
pense of many man-hours which could be better employed in detailed analysis of 
the imagery. 

• A more important reason for using a computer for the first stage analysis Is 

the digital nature of the raw data. Must modern sensors are capable of providing 
digital data either in place -r alongside, photographic images and transmission 
of data from the satellite to the ground receiving .station will almost certainly be 
in 3 digital form. There is little advantage in making liard copy of the image be- 
fore decldin.g whether it will be needed, especially as many of the changes of 
interest will not be easily detected in a single biir.d bat will require colour compo- 
sites of .several bands. In addition digital data permits a greater sensitivity in the 
identification of change as it allows use of the full dynamic rat>ge of the sensors 
with none of the compression of the grey scale produced by photographic tech- 
niques. 

One final advantage of using a computer-based first stage lies In the genera iisa^ 
lion of data to a regional scale. The algoritb.m for identifying changes can be made 
more sensitive to small changes in brightness over a large area than to large changes 
in a localised area, but the eye is particularly sensitive to contrast between adja- 
cent areas and cannot easily compare large areas. 



3, THE data filtering ALGORITHM 

Tlisre is obviously more than one way to perform this initial filtering for change 
but only one possible method will be considered here. Initially we will be con- 
cerned with recognising changes of any kind but the system could be modified to 
deal with specific types of change of interest to the users. 

The region of interest could be a county, a mountain range, an agricultural 
area or indeed any otiier area contained within reasonably smooth boundaries. 

For simplicity the Unit Test Areas fU'l/'.) used for developing the programme are 
50 km X 50 km squares. The boundary of the UTA is used to generate a mask for 
tlie image which classifies each picture element as either in or out of the UTA. This 
step require.s registration of tiie image with geographic co-ordinates but the 
accuracy required for sufficiently large UTA’s can be obtained, for ERTS at 
least, if the geographical po.siUon of the image is derived from tire estimated posi- 
tion and attitude of the satellite. In practice with RRTS-1 the UTA is located re- 
lative to tire latitude and longitude grid provided with the image, but no ground 
truth is u>c(J by NASA in eslyhlishing this grid so the procc.ss is fully automatic. 

The picture elements in the UTA are used to derive a grey scale frequency 
distribution for each spectral band sensor oii the platform, Tire histograms of 
these frequency distributions can be regarded as brightness ‘signatures’ associated 
with the UTA in the same way as spectral sigriatures ate commonly used in identi- 
fying crops. Change.s in the surface cover alter the shape of the histograms and 
thus can be monitored by comparison of each histogram with those of earlier 
passes over the UTA or even with a generalized .signature for lire UTA obtained 
from several earlier passes. It is expected that with experience .specific types of 
change will be recognisable by considering the associated changes in several spec- 
tra! bands, while multi-dimensional grey .scale distributions for combinations of 
.spectral bands, will provide even greater sensitivity although at greater expense. 

The advantages of this method of comparison over thi more obvious point for 
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point comparison lie in the huge reduction in the number of decisions to be made 
and in the generalization of the data over a large area. The major disadvantage is 
the possibility of sigiiificniil changes being obscured by larger but less important 
changes, For example, cloud cover over part of the UTA could well make changes 
in the unobscured part undetcefabie by this piotcss although still readily detect- 
able by otlier means. 

Comparison of the histograms can be done by eye or automatically but in 
either case the process is oiie of lookmg for cliaages in shape or position of the 
histogram greater than tlie limits imposed by the system, These are: 

(i) the accuracy with which the UTA i.s positioned relative to the 
imagery, 

(ii) the radiometric accuracy of th? imagery. 

The first of these depends on how well the position and attitude ( ' tlie 
satellite is Icnown, the geonielric aocui'acy of (he sensors and projection effects 
and for the ERTS-I mulli-speetral scanner lhc.se amount to an RMS error of 
approximately 4% of the area of a SO km squiue UTA. This is probably an over- 
estimate as it assumes none of the error is systematically repeated with success- 
ive passes over the UTA and that all elements in tlie UTA have the same position 
error, Tlie radiometric accuracy of the imagery depends on the accuracy of sensor 
sampling and calibration but also on atmospheiic effects. For ERTS-i digital data 
the rddiomelric accuracy of the imagery is 2% of full scale brightness but this 
docs not include atmospheric effects. It is not yet Itnown to what e.xtent atmos- 
pheric effects will affect tlie comparison of the greyscale iiistogtams but it will 
probably be nece.ssary to include a correction for the atinospiiere in an operation- 
al system. These estimates of the limitations on the system suggest that for a 
single band it .should be possible to detect either a large change in brightncs.s of 
at least Svl of the area (c.g. by cloud or inundation) or a change in brightness of 
a larger fraction of the UTA by 4% of full scale brightness. 
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4. PROPOSED TESTS OF THE SYSTEM 4 

.A 

The computing system is still in tlie early stages of development and will require -X 

extensive testing before any conclusions can be reached about the usefulness of i 

this technique. A test area in central California has been cho.'jen which includes | 

terrain types ranging from agricultural land in the San Joaquin Valley to the I 

peaks of the Sierra Nevada range. Digital data for pairs of successive passes over 
the area in each season will be used and siiould provide both short term and 

seasonal changes in agricultural and natural vegetation for testing purposes. In | 

addition it is hoped to use imagery taken over Fast African savanna regions I 

where grass burns wilt be used as a source of change in the surface cover. i 

{Presented at the Symposium of the British interplanetary Society on ^Earth | 

Observation Satellites’ held at University CoUege f-ondon, JO-12 April 1973) j 









47 








CFNERALISATION OF ERTS DATA FOR GLOBAL-SCALE INVESTIGATIONS 

I.E, Hill, A.C. Annstrong ami K.M. Ctaytuii 
Ujuversity of East Angiia, Nonvich, UK 


ABSTRACT 


Inroniiatiou contained in ERTS imagery is of use 
’ f: many spatial scales. To achieve a global scale, 
:i’ ,aiis:ition to units of the order of 50 km x 50 km 
. necessary. 'J’iris involves problems of both data 
■Ttrt.ll and efficient and meaningful generalisation. 


A system for the computer-based processing of ERTS 
digital tapes at this .scale is discussed. Some results 
based on analysis of the grcy-scalc histograms are pre- 
sented and possibilities for future developments are 
discussed. 


1. INTRODUCTION 


lids paper describes a computer-based system that 
‘C, at the University of East Anglia, are developing 
’■•f the analysis of imagerj' from the ERTS project. 
!>h* .starting point for our work is a belief that, 
.f d.iiugh conventional air- photo interpretation tcch- 
Ti.ines can be, and have been, used to interpret 
I' croicly sensed data from orbital platfoiins. the 
nicrences between the data obtained front aircraft 
from satellites warrant the development, or at 
1 -ist lire te.sting, of new methods of analysis. Tlie.se 
■V methods should be designed to work in parallel 
"ith. rather than in place of, conventional analyses, 
’Uh the aim ofc.xploiting tiie additional information 
i Tstaiiied in the satellite imagery. 

file advantages of satellite platforms over aircraft 
! ijitorms are their ability to provide (i) repetitive 
• =*er;!ge at regularly spaced points in time and (ii) 
fiohal or regional coverage with a tiniform data base, 
-1 ereas a major disadvantage of satellite platforms is 
•i'* limited spatial resolution obtainable from such 


altitudes. Repetitive coverage provides the opportu- 
nity for monitoring changes in surface reflectance, 
due to cither transient piienomena sucii as flooding 
and burns, periodic effects due to seasonal changes in, 
for c.xample, vegetation, or long-term changes, sudi as 
urban development, forest clearance, etc. The poor 
resolution and the global coverage together imply 
that satellite imagery would be more usefully used 
looking for changes on a large spatial scale than 
changes involving areas near the limits of resolution 
of the sensor. 

The work described here has accordingly been 
aimed at developing a system for monitoring s.UeiIitc 
imagery for changes that are significant on a regional 
scale. Probably one of the most important uses of 
such a sy.stem would be a first-stage data filter for an 
operational Earth-observation satellite. There it 
would be used to select, for further processing, only 
those images that differ significantly from earlier 
imagery of that area. 


2. TECHNIQUES 


beveiise it is proposed tiiat the system be used as a use of data that has undergone the minimum amount 

n.5i-5tags data fiJtef, it is important that it ca!,' nta.kc of pre-pri'cessing. It is probable tllt.t any salclHte plat- 
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form Tor uioiiitnriii!; tlic I'anfi’s surl'iice t)ti an opera- 
tional ami repeiiltve basis will return the data tn 
Kaitli ii.s a diyiliscd pieiure ami, I'ur tins retison. a 
'’omjHJtcT-bascd system has been chosen Otb/r 
reasons ibr the use of disused data are llie anility to 
make use both of the sjiectral iitfomtailon provided 
by mullispcctra! scani.ers witiioitt producing colour- 
composite |diolographs, a slow process when com- 
pared witii the dala-acquisition rale, and of the full 
dyntimic range of li<e sensors, willtout the degrada- 
tion in radiometric fidelity introduced by photo- 
graphic processing. 

i•or this project large general-purpose computers 
(an IBM 370/165 and an ICL 190311) have been used 
and the program.s have been wrillcn lit Fortran. While 
a large computer and a high-level programming 
language are convenietu in the experimental stages of 
a project such as this, it seems likely that a smaller 
special-purpose machine, perhaps with some special 
hardware functions, or with some analogue proces- 
sing, could be more efficient for an operational 
system. 

The first stage in the analysis of the imagery is 
generalisation of the data to a scale appropriate for a 
regional survey. Each ERTS tVanie contains 3 x 10’’ 
picture points for the four MSS bauds, whereas a 
regional map at a scale of, say, 1:10'’ could show 
10" - 10"’ independent data points in a equivalent 
area. The second stage is the recognition step where 
the data is compared with that for previous passes 
over the area, and significant changes are identified. 
Note that in most other eomptilL-r-based systems for 
analysing remotely sensed data, these two steps are 
perfortned in (he reverse order. For example, classifi- 
cation into surface types is usually done using the 
spectral signature for each picture point, and generali- 


sation to large areas (individual fields) is only made 
when the I'l jssifivalioit icsulls- arc displayed. It is 
imped (hat b\ suii.;bl,; choree of the area over which 
thisiniii.il i- iirade, it will be po.ssihle to 

mask the cffecis ol large changes taking place in a 
smtili area, and sn obi.tirr a .iystem tirat is .sensitive 
only to clranges sigrriflcartt on a larger scale. For 
experimeiila! purposes, L'riit Test Area.s (UTAs) 
50 km sqitaro have beer) chosen. 

The simplest possible form of ge tie rrtiisa lion is to 
reduce the data for eacli LIT.A to a single luimhcr for 
each spectral band; this mimbcr could be tlie mean 
radiance, in which case the generalisation is equiva- 
lent to reducing the resolution of the sensor to match 
the UTA, or it could be some more complex para- 
meter. For most purposes, this would be too drastic a 
reduction in the information content of the data, 
although this level of generalisation could possibly be 
used to recognise areas with a higii degree of cloud 
cover. 

The Ibrm of gCneralisirtion used in tins ex.periment 
is to characterise each UTA by the grey-scale frequen- 
cy distribution for all tlic picture elements within the 
UTA, The parameters olnaincd from this generali- 
sation could be cither the grey-scale histogram itself 
or quantities that can be derived from it such as tlie 
mean, iialfvvidth. skewness or number of modes. 
Initially the histograms have been used, but one of 
the aims of liie experiment is to determine whicii 
parameters can most usefully be used to monitor 
changes in surface cover. 

TIic histograms can be compared visually, or by 
means of siatislicat tests. In the latter case tiic Clii- 
squared and Kolniogarov-Smirnov tests have beer- 
used. 
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X PRELIMINARY RESULTS 



In the previous sections, the aims and lines of 
investigation of the project have been outlined. 
Regrettably, it is still too early in tiic experiment to 
say how effective this leciinique will be. but in the 
remainder of thi.s paper some early results and pro- 
blems will be discussed. 

The area that has been ciioscti as a test site is 
covered by one ERTS frame, and is sliown in 
Figure !. Tlie site covers part of the Central Valley in 
California and extends as far as tlie coastal ranges in 
the 3ouiliwe.st, A large area of the Sierra Nevada range 
is included on the eastern side of tlie frame. Digital 
data on computer-compatible tapes have been obtain- 
ed for two successive passes over the area in later 


summer 1972, and a third set of data has been obtain- | 
cd for spring 1973 In addition, a secomi test area in 
central England was selected, but repetitive coverage | 
is not ye! available for this area. 

One essential requirement of the technique is that | 
the grey-scale distrihutton in unchanged areas is suffi- | 
cientiy repeatabie for changes in other areas to be | 
recognised. Figure 2 shows tlie histograms for all four % 
bands for a UT.A coiuaiiiiiig prcdomuuititly agricul- 
tural land in Central Valley. The only changes visible % 
on plKitograpliic prints oi' the imagery relate to a few p 

fields which logclhermake up V/c or lest, of the UT.A. || 

The liistogranis have a similar shape for both scenes 
103K-IS114 and 3056-18114. but there is a small ^ 
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f r^ufe I. The lest site in central California. The Central Valley from Stockton in the north to !'re.mn in the 
south is inciucted in the frame. 


'i'tft to lower grcy-scale values lor all bands for the 
‘■•K’r pass. This sInTt can be accounted for by the 
'decrease in sufar irrailiancc associated with the 
•'Scctcase in solar elevation. Table 1 shows that the 
‘■'0 uf mean radiances is comi'-aiable vviih the ratio 
tile sine of the clevatifm anjile. 

Aitliougli tl’.ere is very gc'od agreement between 
die large-scale structure of the hisiograms. particu'ar- 
'or band 7, tlrere are many smali-ei features which 


are not duplicated. The histograms for bands 4, 5 and 
() lend to be 'spike;, ', am! even in die baiul-7 liisto- 
gram there is a smalt spike near the peak of the distri- 
bution. Tiicsc spikes arc not real structure in llv: 
grey-scale trcquency distribution. Inti unginalc fruiu 
the mapnin” of the sensor grey M-aic witli iis f 1 
values to the v ('1 grey scale winch lias 128icie!; lor 
b:mJs4. .5 and 0. aiui 64 lewis for band 7. Hecause ol 
the nonlinear uansi'er function and expanded t < T 
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Table 1. 


Band Mean Radiance fbrllTA from Scene 
iOiS-lSiW 1056-18114 

Ratio 

4 

18.7 

17,8 

1.06 

5 ■ . 

18.0 

17.0 

1.06 

6 

24.8 

23.3 

1.06 

7 

26.0 

24.8 

1.05 

Sun 

Elcviitioi) 

52“.l 

47“.4 


sine (El) 

0.792 

0.736 

1.07 


Figure 2. Grey-uale hhtograms fora UTA tn the Cen- 
ira’ Valley. The open Insluff-ams ut-c derived 
Jiom FUTS H’cne lOM-lSI 14, and the 
shaded histograms are from 1056-lSJ 14. 
The horizontal seale for all bands except 7 
is divided into 6‘t grey levels, each of which 
corresponds to two CCT grey levels. 




grey scale for Lands 4, 5 and 6, ami because of the 
radiotnelric corrections at>piied to al! bands during 
the transfer, there is not a one-to-one nrappiiig of 
sensor grey levels to f'CT grey levels. This is illustrat- 
ed by rigure 3, which shows the grey-scalc frequency 
distributions for successive groups of six scan lines 
down a strip etne quarter of the image in width. The 
features to be noted are the blank columns correspon- 
ding to two grey-scale levels for this band-5 data 
which continue through lire distribution fur several 
successive litses. The calibration process changes the 
ruipping from rensor levels to tape levels as the seiisi- 
ttvity of the sensor changes, and over a sufficiently 
large nun i her of lines this tends to smooth tire 
frequency distribution. Alternatively, lire histograms 
may be sniootlied, by nso of a suitable weighting 
function, to give about ,j 2 independent .grey levels. 
However, lire smootlitng function for bunds 4. 5 and 
G is dependent on the aetuai grey level and results in a 
nonlinear relationship between Lipc grey scale and 
radiance. The effect of liie noidinearity is to weight 
changes in low-tei'lectriticc- surf.ice cover tmue lieavily 
than clian.ccs in lire upper half of tire radiance scale. 

One |iart of the conijutter system which has been 
extensively testeii is tlio segment that aligns lire digi- 
tal data relative to the U1 A. flic contldence that can 



•_ r ^.i .. ..r. 




Tignrc (,rey\Si:dr iu.itograms jor suveessive groups 
of .six scan hues. 
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be pliiccd in cliangos in surface rctlectance dcteUctI 
by this system dcpciuis on the accuracy of this align- 
mem. I’or c.vamuic, witli 50 Jmi sqniire UTAs ah 
r.m.s po.sitional error of i fair pltices a li«ver limit on 
tlie iVaction of the iuca fm v’likh cliimLics can be 
reliably detected, of above 5% of the total iirea of the 
UTA. The proposed computer system uses the lati- 
tude and longitude grid mfpi'Euution iiv ttic annotiition 
record of (lie CCT to convert the co-ordinates of the 
corners of the UTA to tape co-ordinates (scan line, 
picture cieinenO- The pre-launch estimate of the 
accuracy of this conversion was aboiit^ ^^5^ nv r-m>s.* 
for the tiircc scenes for which tests iiavc been made, 
the errors arc in the range 5—10 km. Figure 4 sliows 
tlie positions found for tlic dam of Honey Lake, Cali- 
fornia, from the annotation dat for scenes 
1038-18114 and 1 Q56-1 8114. Tliis error is,sufficicnl- 
ly large to obscure any changes involying less than 
about one quarter of the area of the UTA. and so, for 
most purposes, is large cnoiiglr tb prevent this moni- 
toring technique from working. 

The UTA could of couhe be positioned more 
accurately relative to the digital data by use of several 
control points on the ground, but this has the dis- 
advantage that, for an opci.ilionai system working on 
a world-wide scale, the iiombcr of Coiitrol points 
would soon become unniahagablc. The ideal solution 
for an operatiphal satellite platform would be 
improved attitude and position iiifortiration and con- 
trol, but this might not be tcclmieally Tcasiblc. One 
allemative method, which is being tested, is to align 
one imago with the ground by manual niethodsand 
tlien to use tlijs primary image aS a correlation mask 
for successive passes. 

As a final example of the sort of difficulttes that 
need to be overcome before the computer system can 



Figure S, Greyscale histograms far an area in the 
Sierra Nevada range. The histogram for 
scene 14 (unshaded) is plotted 

with only the xmsatumted part of the image 
used in caleulating the fteitueneies for each 
grey-scale class^ The greyscale classes cor- 
respond to three CCT grey levels. 

be used operationally. Figure s shows the effects of 
clouds and cloud shadow on the grcy-scalc distribu- 
tion. TIic histograms arc for the same part of the 
Sierra Nevada range, but in one case the area is 
partially cloud covered. This produces a sharp peak at 
hig!» radiance, as e.xpected, but ill addition the cloud 
shadow, and structure within the cloud, arc sufficient 
to make the renrainder of the jiMograms totally dif^^^^^ 
rent. The implication of tins is that cha^ ill (he 
cloud-free area cim oidy , be detected if some spatial 
inrorination is used to remove the whole cloud-afFecl- 
cd area from both images before prepariiig the histo- 
grams. 


4. CONCLUSIONS 


The results presented above tend to emphasise the solved. Otlier work prtveeding in parallel wilii 

difficulties that have been met in developing tlie com- described here docs, however, give some indication 

piiteriscd system, and give a rather negative outlook . that use of grey-scale histograms as a method of 
on the usefulness of tlic technique. At lids stage, gcnerelising tlie data is a satisfactory solution. In 

howcv.-!r, much development is still required, and no particular, it seems possible to use the histograms 

tests on the effectiveness of the system will be o j,ssi- alone as a means of dividing the image into surface 

bic before difficulties of the type dciCcibed have been types by using .standard procedures. 




GBJECriVE GENERALISATION USING ERTS IMAGES. 

by A. C/ Amstrong & K. M. Clayton, (forthcoming in: Environmental 
remote sensing pwctices and nroblems . 1975)> 

1) omcnvsB 

/ ■ 

An orbiting luatQlllto ohaerratoxyt ouch og' pjcpduoos data 
«hpeai {i«alitl<» differ fnon any pncRTlomly avaito 
ot Ilivs Ihan of consrentlond ai»>^xnii ^ 

and thus it in dif^oiAt to aRF^ oonvtmtlonai te^ 


Aatorfe«Ktatiro Goapansatins^ for liiiii lorer roaolntioii is 

O’ttelXi^ potohtioily effort 
Worldn^ C9rcally» auch a eyotea prcduooa on enoraoua quantity of 
data! a 4\loIc^^ ehoua t the oapabUit^ of 

filling lO** rcolo of tasaotic 53ji^ 


iUo of any ep^proaoh nhich allows a redostich in volim of 
tita data* vnco a roduotion in the volixoe of data is aobi^ovod it 
b^craeo poeoiblo to take advontaso of &d eohaistont data^ 
for crlob:il Invcotlsaticns* 

Iho approach US disoaos ttahos toe of Ocaputor^ <^^ 
technician 1x> aoMcro this data 

Kultlopoctral Scanner aO Its data 8otiroe» it atteiaf is a 
mppiag of -{ha car1& aurface on an o'Daocfeliia feslSk Sheh a 
olasslficatlon would 'ca distlnot|. both in ob^sotLtity / and in sca^^ 
from nny'^^ai^ posnlblo hafore tlie advent of rasote eensins satdllites* 











S/t 


2. Chnrnet<>yi oti.co of KRTlt df».tt]| 

Zii a i3un<^ynchromv9 crHt at a hai^^Ut of 9X0 ^ 

X4 orMts a f t» rvovldo 

prooiaeXy^untroXlod ro]^tltl70 ijoa^tory aenoln.? syoteao oa 
board ora a Uottu’n Jktam VXdioon Caiiicira (rwDir) and a KuItlepeotroX 
Soanuor (nrjs)* l>o uro oonccruo'l \?lth tuo lUiS data aXthoi^^ ttua 
tooiudiitiaaydi^^^ bo appxiad o^aaX3^ to fho Riuv 

othor auXtispeotral di(^taX data* 

She nS!? ao^o an anj^Xo of 2«89^ in a diroctloa 
3»orpe^iciiar to tliO oatoXllto path, tho fortjaxd laotion Of tho 
aat&XXito onawslos to South acaa. !i?ho syotca prpduooa 

no inavsea dtajius too South to north half of each orhlt (which 
is in darlcnoQs)* Each ecan lino in XS9 in Xoneth and too 
eontinuoug strecn of data io eplit into secticrio XaS Ka long, 
called -*$oonoa*'^- 

Tte rovratloa of tho Earth benQ^th the satellite Ijspai’ta 
ft biccKf on the imaeo which verioa iisireroely trilth latitudo* Por 
Calif orcian irO t»© toe skew is 5° so that sucesasivo 

scan linos fixa displscad to toso loft hy about 20a. Por aost 
practical applications it is conarcnicat to nogloct thla £ikew, 
aoBvaains a reptanijitlar array of points for analysis, and correcting 
for toio show in the final dlapiiy otagos* 


She hSS eGiises too rofioctodradiaticn inf Ota? spectral 
regions, conventionally oollod bands 4 to 7 ( table •l). £hd 
noainal iniitrintanaoua field of view is 79-i, sltoouvh thar gystea 
la Capable of dotecting smllcr cb jpeta, gaoh scan iica is 


JWTCd on 


ai. 


1-1:3 roaoluticn of top is 


thus 


coarser tlian thiit of 


OSnircQtioml alrborna e^^lSing ej-otens* V 



lllWtlODnGBlIJrr OF THB 
WtOE B MOB 


Tho data oro oado avollatflo cs photo^lpKi^ihio Sxiist.(pa or 
dlaitlaoi aa Coaputar Conpatildo 'i^apso (CG^a)* Coccuao Of tb? OapeMli-^ 
of nuDOjficaX liX'ooooela:; ta icrwrida ob^octlTO isoxarallaatioa, €» GC'fo 
uoro uaod co t!io iwiafiiy diita Bouroa* 

Each ccena 4a CC7 foiw coatpijao 2340 ecsa 44&oa onoh u4th 
3240 piolrara oloconta (sisolo)# !2!li6 f oup opaaiTnX bazdo 
Intorlcavcd, so th.at «io ciatlopaatral liiTora-tio^ cbjo 

point 4o rrrdlablo at ona point 4a ti-so top©. Scan lin»© nvo 
split iata four ©PjUal i^ovtiona, cad placed on four Dse»»tio 
tapes, «xtia div3.dia^ ercU eceaa 4sto ftxiJ? i:c?t;i*S6at!i a fed pa, 105 n 
4a J&S. fe* 1). 

A fttllci* tjGDGJpip'Soa of tji© oystaa cua bo foixai in 
ilA:*A (1971) ard of tba CO? in aioaao (1973). 
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2). G!>A:ji 2 pic vricw racc.iDU.iAo i:i p.;;.ot.i s^iwq 

iiocaURO or Uta aiao of tho dita sota involved, renoto oe.ijln? 
ap::lication3 h.iv:i uogd oiily a liiaiiua, nisalwr of t;v:j aaiy clrv;aif ioatd on 
eti’ itQjioo ;iV-iilabio« T;icso proeedurc!.'} )uivo iioon laoi e ji bo 

c 

concdiiarod iniUilly in tcraa of too fiualoaBat.;! dichotonioa, l«tAfoou 

■ 

tiio £jUf»orviaod ruid ici3U?orvi>ed, and b^toeou uad ;Oi^*fioli chr:‘:,1 

(, 

?ho j^roatoafc effort no fur has cons into ; or-ioint elr-selfic-tion 
i.rcceciuroo, isiiicii oecd data polaa in taru, and uiti:er aaoi{?j it 
to too nearoot c.lnotor, er cljssify t!io t-’oint by rii-plie- tion of an 
d •-olaiun fimiition, thua Iji'H'-ki: th? o-i.-.t.al ?u'r;:n-ci.v 3 nt 
In r-aad r'cnotaly iicnnoi * t:.. . ra-ricll c‘‘ ;aai.fic .ti:;n 

rr*O 0 '>:ln?en, on t'-e othr? t. vC'';'-: of tbn c‘nr .•.• S’r5.stiCJ 

of fiv sUt'!, ;na atiOnnt tc ciiacif" :vj‘: iriliviau 1 ecintn U?t 
of ijeuvl-ucun voJnta, or ’ri^Ida*, flcl.ln c.a bo 

ti3ct;.a<0.ed' oif'.or ly cnc'ininitioa, of a a d in, ox' uy fieri .•nraiia.iy 
r.yoloa* 

fio a.' -! t:f r:i:::orvlncd : rocodiar 'r a inti;: fr:a c .rlicr trer': ’-rdsb 
to C’taloyufia c-i‘ * a r'ctr”.! tuxor,* , ^ O'-over, 

»_i . ' i\''. V 1^." G i b 1-*r * * w ' G ' ' 
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caonj: t!i0oo ocheaoa ia tho I'a^rosian classlf ica? odoptod 
tjy LARSY3 (Llndcnlaub, 1975 )» ^ftich reoulroa tto ootlmtlon of tte Doaa 
vcotor and vaidanoo-cGrcrian.co laatslx for C3.ch land uso tgrio* Thoso 

c 

etatiotica aro calcuial^ fron a trainiiVT eot;, A aecond s«aplo tea 
uMoU cJ^oujid truth is also avcdlublQ ia deai^^tod •tostinc oat* 
iho performneo of i2ia claoetficr on t!As tost data ia than 
ao&ooBOdf ood tliio io teton aa an estimta of tha porformaiuxj of fha 


olasalfiar oircr i^io vhola d?.ta oot* 


Statlstieallyt tliis proooduro ia fri'ausht \dth difficulties* 
2rainin^ sata are oftan coapiisd aybitrr.rl7^, in riolatioa of noraal 


BoapUrii; aasvJiptiGrs, tliorehy jdelc'ins incoaroct eatinateo of <ha 
population p.'u?a*iotarG» r.na f.ij.V.soqu.ontis' poor pcrfox^naco of t-io 
eirisaificr (Bgsu crJi. caeil 1974). l urtior, both ^rrdnirv? end toatins 
eats aro liouully coapilod fron areas 'kjljaro land use are clotrly 

defined* liioi’eaB larco eectlona of ^i^3 my trail bo conoid erably 


less cleiT* 


BGCauco of t;'jO{)Q stoti i' 'aal probloos, Ilocsr et ul (l97l) ar*-3Uod 


it is sore efficioat to elaeoify tkj data initially, and then to 


collect c^av:^ trutii foi* oacii cf the cl oas preduGoi* *'iio alteroa tiva 
fcCrcj? r '■’-‘'<■'■^1 dccioions cni loco .c.ddit5.oral groun'l-'baced 
iafornatiai, tl^roby boias efficient in er.pl<»atory 


Inveo ti /Siti eora * 


•JBdc inireotl.'tatien irass a por^ficld, unc^RQr'^^.s^d irocoixTje, 

Tha use of a per-^ficld approcch oeonad to offer tho -creatset poocibility 
of cC3ioraUcati.Gn, :nd por,:::Lta a varyln.: iov*:! of ly 
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The clnnsificatlcrti procedisro usodl la thia imroatlgatioa InvoXvos 
ttiTca data filoa, cad tlireio daoioioa Woslaj (Fis 3)* orlsJ.nal 
data oa tia CC2 cupi>Uod by lUSA is reocnnlloa into a scseiea of 
uXozetot th 3 1‘lcXda* J^om tlsee ficido, Buaa?.xy laaasiirea ora 
conpnted, Frodxicing a foatoa flic f^ov eacU fiold ;;Mch Is input 
into tto class if ioatloa alsoritln, Gxd tto cad result is a map 
of Iho areaj Utb oatfii field assisnod to a cato^prjr, 

5) Pli^LD SsiliJSSlOH 


It is oWicTiiGly unrealistic, esnd in teras of tliis irr;aotisatioa 
uanocossary, to elcsaif y individual pixels* Since our aia is 
Caneraliaat'.ea, it ia ccnvenicat to reduDS data cat at tiia 
stac^e, \rj eor:aidorin 3 fjeoups of pixels,, tlie fields iMch can thoa 
1)3 cla 30 if.<ei. i’iolds t.-ere arbitrarily definsd a§ a s'jiiore bleak 
of piKols tains tapo coordinat3o, rcforrics to p^ujallolojsraa-ohaped area 
oa tba sroir/i. iTvla arranpensnt is CGnputaticnally convenient, and 
alao in line uir’i sebemes used by O'iicr workers far per-flcld studies. 

It ia furthoi’ easy to iasroaao the IcV'i'l of c^-'^^ralisaticn ty 
eaelsiuiiatir-j thoee fields, :iliiicut reirjrnins to the orisirul data 
soi?rco. 

Cover; 1 f-ctors ir,f.Ti^or.co tt;3 cdcc of field tv.-.t Jn appropriat). 
tZho rcsolutita of tlio roq[i;irsd clxs&i’i cation die totes a eeadiiua 
slr.o cf field, r.vA tlio resolution of tho sensin'; oyatca in-ossn a 
Icrtjor linlt. The ecalo of the csslyris trill nf feet the slr.c thr t 
ia apprv^riaxe, for ia General tUo iiercer t!ic area to bo ana lysed. 
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Thi .5 looda to •%a practloal ccEKj:ldoi*atioa of coaputcr otozaco 

lihlla tliia factor ia not irtiopcndeat of tho oli'^snlflcatica olga^ltlsa 

uaod| thoro is In caarl^ ©very caoo on anj»r Unit on the niabsr of 
% 

data pointa tfeat con bo hr-n^Jetl by ary onj yrosir.w, and oven If <dil3 
opaco la not a ll-iitins faots:;r it lo l:l!-o2y that :r.cliina tioo vIU 
beoeao a llialtin;; factor i;iotoad* 


v.’i^ thooo Gonsldereticao in nd.nd.> rjid nofcli?.?? (ho oxisinal pnrpeao 
of tlaa invostigation, tiiree sisao of xforo ueed (table 2). iho 

br^pjiQ field VRS trfion aa a siso of 100 z 100 pixels, yioldinj 750 
ficlia to ci:r3 econo, in an array 23 dcnai m>.i 32 I'.croao, '^ioeo 
bacic fiolda trero tiria cjra}f;.ui.vatod to i:i*x'd*dco larrvr fields eoatalEtj 
200 X 200 -ixcla (16 aeroce by 11 do;:-i}. i?or jr agnatic reasoris 
those f is Ida troro not oshaosti'i'o of ^ original iuo^o. Ihns the 8 
fields acroGD cao'a CC‘f (3/4 i:::-go) loft a n..;rrc;i; etr.in 10 
vide tiuit t;as not eGirjiderc:U xiio ir.Gro: cea creed of r-roceaa.ng 
EGined 1^ roatKlctang Iho field ocaipilatien pro£rrxi to a oiiTglo 
GOiirss o:ignotic tape uas die in. ortrvnt factor. ai:J.la-i*ly OiJy the 
firat 230'3 of die 2'3-iO oc:gi li-'.an vero rgol. llio cli'UJtcrinn 
algori'Sa tiua dorit tiitb osiapiote iicide only, no tk;i^ ecale 
problcris that lalglLt ariao \:lth incenplo le iieldc t:ere avoided. In 
fu'baro, for c: eir'.‘I~Lc-rr.l im’crontrtlon, it vo”.ld rrobably bo rrofC'rabJ.o 
to rido-t an r.rrar-T.r.rat that locra cc-spio-^oly'fittod V'o tapeo, in: in: , 
for oxtonpio, fiolda 101 z UV pixel o in ciso. 




St 





a«> oat of 176 XHolda could oaoUy ba handled by a hlotarcJiioal 
clustorii^; procrrji, hut such 4 alscs?itlia could not wujlJy haodlo the 
laTGor ost of 725 fiolda, f© vhioh a difforeat, (itGr^itivo) eluaterias' 
tsohniquo was innlonontod* 

Soaa xosulta mxe also ohtaimd ueius a srallor dold, 50 x 50 
pixels ia aiaot producing a total of 28s0 fiolda to aa EUEJ eoem in 
an array 64 across and 45 dotm, The also of this data eot noant (hat 
far f©^;or cnalysaa could ho jei'fonaed i\sln5 this sot* Kowotot, using 
all >(iu?ea sots aiid a coesaon al?j;orl(24*4 it waa posalhlo to imesttgito (2^ 
effects of dlffarout Hold siso on (ho acliietfod eenoralisaticai* 

4) FKvrunii •^j'-Ji.i^TiojT 


Any single field contaii^io a vast snouiit c? inTomaticaii it is 
both ln?o^T3.blo^ and uaaocessory to ciansisy fields using all this 
lnfomtuti<n» -Peatiiro solectLcn is tlio rroceas xheroby uo sussariso 
this vast anoy^t of iaCosEatioa and rcd’ace it to a atiailsr ncro 


maagoablc net, eiinimtiag aa euc:i of the rcdiuidant info ram tion 
ua possible, xrliilo rataiKlng as nush of tho eosentlal as is required. 

Obriougly fs?oa any Jfjr {,*9 field any nr'Jibor cf fcatmos esn be 
C03iait*‘a, ond tbe o!ioloa E,ur:t bo ste cci'io cn:t 3 at arbitrary* Several 
tooliniquoa ealat for dealire xdai Iho soiectLon of i-jx optLaun sid>-s©t 
of foaiurcs, doTOloT-cd larqaly j?or celeotJ-ns ‘tbo l»at eorabinatica of 


spcol-r-l i :-!:: a ror**: oint cianririlca'Jicn cf nultinpoctral data 

(oci.S-.r-in C: ^i ^^7 Xn -t;’:*'! n(a;dj t!^? ueo of ci*ch tecli:!: lie's 

is ii'^t directly ap' llcabXo, r-liioo t-erjo is no cOr-nrly defimd set 
fren ’Mac.'i to icr:;' an Gp’iioijai G'ub-sov, oiit raiiior cn iia'lnity of 
rcssi-blG foat-ircs' wMch wa must arbitraidly choose* Shis cheU5£ 
in :j;fo i.y o^oaifci'iny t:,e h; :iL cf fc::t^xo^ in tbo irnv^» vra vlun to 

tbcoe nl'tit ba doiojtod. 
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IrtBRODUCmiUTy OF TBE 

SoveroX losslbilitics eu^east thcaaolroa as suitablo voKlaldesa 
It Is siaiplo to coKputs tbo-noan rsflcctanoe Xn oaoh apootrdl baxid» 
aa'l than to liaa a *ueaa apdotrol ci^'iiaturo* ci caclx fioldy thus 
, reduoins tlio datav to a tots! of four Variabloa par flold* /dtho%H 
. this set la potentially veiy efficient » tfeere is no reason 

. for thimdLatj tliat tlu ac.:n opootral e.1,/jjaaturo for a flold chould be 
' ai^ more euooeseful than the point spectral si^tia?et partiOilarly 
over the largo epatial scales ijnrolvoS* SurpriBingly tliis foaturo oot 
has been tised relatively sttcceesfullyf but the i~or.3ralisat3.oa is 
unnecessarily eztromef and It is unlikely fiiat tlds would provido an 
acceptablo biiels for routine olasslfication* 

A more conpleto feature sot inslules not only the noon refleotanoea 
but ^00 the frequency distribution of ro21ectaneo valuso* Rather than 
attonpt to nodol those froq^ueney distributions, vTith the attendant problons 
of firidin^ a nodsl of sufficient geiierality mid of estimating its 
pariuistcra, it i?a3 decided to osanine the diotributiens directly. For 
eacii field tbs Jiiotogrca of rsfloctaroo values in eaeli of tiio spectral 
bauds was coapiled, and ths data contained in tnoeo hiotogrras used as 
tho foatiaro sot, Tliis was achlevod by splitting the rans^ of rofloctanca 
values into 32 equal classes, and recording the nissbsr of pisolo with 
a valus in eadi claea. The values for all four boi^Q arc put into 
cno vector# 1‘23 r.::nbor3 Icrv, uhlcli bjc-na tJvj fcaturo v'-otcr. This 
prooodaTe h?n certain adv.mt -jg’S, Tirr^tly it c' 'Is vdth 

pols’uodal dlntributicn.3, yocordly, rdiiso it dccls idth I'niroin fields, 

• tlio Vriluas can bo the total fronutmey ci‘ occi’rronco end ronuire no 
fnrthar rrrr/’lia -.tier, hnn tho rc.U’lt t> >'^ Icrgth of ooch 

axiii in. liio fcaturo opaca is dir ::;tiy ralotci to Ib j nac.'.l o-. vsr' ge 
on iho r:i-C'T.l \;V!:hin tb .t r:r o, and Ihi-: tio v.v;V fire 

auto ;nt.LG.'.liT rc- iioi xn th-jL* ( ■' in srtMiCo in of ». 




j. ^ L.i 
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covoro^o* Tlilrclly thia aet atill rot:dra a 1 g .^50 aQount of 
information*, auch it ia oaoy to SJitxalt it to further comproBsioa 
if roquired. For osaaolo, it vould ba easy to ostloato tli3 osan 
and variaxso of oach spsotral bond from tois fcataro sot* Tuua tlio 
foaturo file derived at toia otace acta ao a aecoisdaxy data sourco, so 
that furiiicr foaturoa esa I© crfarac ted without roforanco- to tto 
original CCr data* 

Thia foatura aet resulta in a great ooni:reaqton cf 11^ date* 

For a IGO a 100 field aocs 40,000 ix;alcre are suasEiriaod 123* 

Tills oocaiderabXo ^vlng can be rerJisod wii^i only a irtedest 
esperiditee of coa:,utsr tiias* Furihor, tills set is sjatiil eiisiJgh 
to to ia;;ut Into bo, at ciii^toriBg proc.'rat^ without cx'oatiri/j estsessivo 
denanda on ccuouter time or space* 

Several other foature seta have bsoa used by othsr Invcs titters* 
Varleu'i tranafema have been usod an r.caaurcs of texture, fa? 

CKsiapIo by i^L-vIda (lS73) uho ccEiparod tlie use of the Fo-urior, 

* \ 

Knrhunen-Loova, Vfalsh and Slant ircuusforaa for tliia purposa* 


Ko;?over, KSrvida calculi. ted iiioso trensfoias over snail fiolde, 3x8 
pir.ola in slr.c, and thoir extoiihdcn to lanr^v fields would requiro 
greatly inoroacod cou'-utatioa titNta, end tlie otora£^ of a very laj^;e 
nunbor of transferB GOeffieionts* In a sinilar way Uax'aliclc end hto 
GC-uor!:9rn (ll-.raliGi: and •52iarirati:jaB 1574 ^ita 2 .*allcli:, fhaanaiyaia and 
K,:imtein 1013 ) I'-nvc Lsaaured textee ii3lxrrt!ve spatial "rey-tene 
dopcndcnco na'^rix, doxined over jllolds S4 x 64 pirsla ia t.ir;e* 
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Thors io thus no oin:^o oatisfnctosy sot ot foatures avoilablo 
for doscribiti”; largo fields that can bo iaplcnented in reasonable 
computing tinos* hybrid apixroaoh dlsoussou by Graconopoulos 
(1374) presonts an intorosting altortu'itivo» using loi optical 
. oysten to produce it I'ourior transform of fieldsi nhioh is then ' 
oonrortsd olectro»opttcally into a digital aignal suitable for 
input into a clausifier* 

5) CliOlCB 0 ? CLUSTSa^!© AWJORimi 

A great variety of clustering ctratcglos could I4 impleiffontsd 
for this otudyf and some ohoioo must bs audo betueon them to deolde 
which are boot for tho purposes in hand* Published teehni^uss 
wore used sad nodified only wlisre neoessaiyii so that the perfoiBance 
of ths classifiers oovdd be related to previous otudiost and no . 
attempt was r^de .to add yet another strategy to tlie already long 
list of stratefid-os available* 

TtfO main croups of ciftssiflcatloa procedures can bo identified* 
Firstly* hier.arehical clr..ster analyses* such as have frequently bean 
inplemsated in iiia social and bioloricr.l eciencos* constituta a body 
of standard tooiliniqucs* To theso certain nodificaticiis hsro been 
nado to cater for the special class of spatially organised, data* 

Second, thore is a diffuse body of other strategies, most of wMch 
involvo rather ncro ^ assumptions about tlie data, but ctvn also 

Iranile larger tint a sots. ______ — • 

C'y.zz:2a to :.ll cli’.jtor.ia; tochrj.pv.o:} is that they rcraire sene fi'Qssua’s 
of tho di'^tr'ce het^r’on points, bet'.joen clustsrs, and novoral dintanee 


.'-V' rv/a’-bla. llct'.ao' 


I'st’.'.'on ' cir.tn, and batrrsen r.ointo 


ap’’ c:;"'lr03 t}io tj." ..-\v 2 . 1 idc ".>4 dlrtrxco 

is neat fr:. i.ly vsoJ, alUiouph th ; .altcma'idTS *city*“bioOir* 
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dlstAnoo h^jLO a certain bsonuaa of fpKJator oood of 


coaputatlon. ^.o diotancoo Intx^oou clustora of pclnta is a 


oanaldorably hV::-o ooLn} lt.s problets* ditto It Ir/clvea not onl7 tho 

#• 

abr^ioluto ttir,tr*nco Vot;:ooa nlvstor catroa» but rsleo Cio diopordoa of 
the obsorv?.i;icna orour4 thoso contrca* A rcv^e;; of theoo uttasisred 
lo slvcn by i.a<^ox* & Landsrobo (lS72)» wbo uonclisio tiiat in eeaerd 
there is no oix^cn^ reasjon for choooin^ ony cm in preforenoe to 
, any other* 

5a) ChASSIGAL CLUO^nil AmTSiS 

?ldG body of toohni'iuea naos vnriotiono of tlio alcodtlra 
deacritttd by v’srd (1055)* for i!TOduoii:iff hiorcrchical cloadficationa* 
Thio cjethed cjEiaicoo the obsorvationa in thi varlnblo apaco* and 
findo tho tr.ro obaorvatioaa clocoat to^jothor* Tiaoe t>ro obser\rGticr3 
ore replaced by tlttir cean position* cKd tl:*» whole set of pointo* 
rodtttod by oej in nuaber* io exaniined again to ^nd the nost 
oiaallont dista/tta* ?hi!S at oacJi itoral^oa ths nmbor of pointo la 
diairilshod by one* aad a hierarcMcal classification results* 

This croup of tocliniquoo la ar.-ilrihlo in iiany versions* and can 
bo cocsidoi«d ii 3 titt struidasd claasificatlon laolhod* iiowsvci, they 
require the calculation r.nl continuiid re-<tceos3in.;: of a satris of 


distances bQt;;ooa all points in ths vcriablo space* Thua to classify 
tt points* they tyi.'ionlly r. auli'o the uae of on a by n distaJico aatriR 
vriiicii is 

on -ha i:rab::r of oh-arvatiem tJ^it can to 3iv^Ilod* toot . 
co:,i; utor ryncs-iio can co':3 vrlth only of tiio order of WO paints* 
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and ovon tl>3 wso of biwjk-up otoro^'oa dooo sot poxolt iho ostonslon 
of tlio al^ciitlia far bqjroad ihio Halt (Opoaohav, 1974). Thla 
tooljalqua uau Ijaplettantcd &r the eliissifioattoa of «io 176 fiolUa 
200 by 200 piAf'Ig in ol:so» uols^ tlie prosraa J^ubliBbod by Yoldaun 
(ISGY), TMo ia’psraia hao besa used by Owoa Jor^a (pora.eoiaa.) la 
14io rofflote Boaaiaj field, and ydoldes wbat can bo Oonoldored aa 
otandard rasulta* It could not, bovovor, be extended to copo >dtb 
tho lca?gor mcibop of fiolda 100 by ICO pix&la In cics. 

It Is possible to U30 the spatial chasaeieris tlce of tbo data 
to aCiiiova a reduo tion in storagd roqulresKotOf end thuo bandle a 
largqp data cot. Thua by joining spatially coi^ijpjoua points, 
it is posaiblo to produce areally tiebt elusiaro* This permits 
tlio sioring of only tl^ distanooe batT;cen coatisiieua points* 

Openslijir (ISYV) has used this ts^niivud to prodacs a progriUn that 
can clus ter 12Q0 points in tho sane spaco tho VoldBaa. prosraa duo tors 
200* This aaviag la coanidorable, but the approaoh presents concoptBal 
difficulties* it is by no nsans obvious that should expoot 
Cidl:.ccat S.old3 noeassaxily to belong to tho esao class* furtiiQr 
thoro 2 rs grounds for xiisiiiGg to plt>cs spatially non-centiguous 
areas in ths class. Thus for oxaGclq if tha inns® consisted of 
a lei’cs a^ioultua>ai area crossed by oiw distinctiTa ilocd plain, 
ve would want to produce only two classes, flood plaine and 
agricultural areas, fand nob divids tlie ar-icultaral area into 


clas.;-.--! ce-’rit-A Y-j th? -flc:-! pidn. :,ash/a clas^tfioatica 
parh^ns ba aebipuad by rslarf-ug <^0 contiguity eomt?aiut at sene 
(sahj-etiv^y - de terminal laic, sta-gj ;in ita anaiyaie,i eueh as uaa 
dona ly l-''.^aa-ucna 3 C^ij t'a'ice ( 1574 ). Tloi,* 97 Sr, in vio;f tlds 
difficultr, t:via ' ral-’ere tfga n< 5 t' 
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5b) 

CJP i^io aworrsX cltormtXvda to 1210 hXororcStiooX tod»al(iwo» tuo 
procod\s?oo dovcXc^pcd Sot ?oi3oto conainj ptts?poaoo veto iwootieat«tV» 
■^hoso v^ui•o i*3;a itosa-idyo cXmferinj Dothod Stirdn (X072)t 

oxii tho chain twthod doocrihod by Kagy & ’Solaha (X972)« 


¥hlO towiai^^io ffeqtiiffso 12io^ 6^ luitiaX oot of oXtiotey 

conts?ao, idiiobDay ba dotojKincd i by taking a eanplo 

^ tlia data* ti*is3 obaaw^ aro ISsoa aaais:;^ to^ 

ootttrof aid tbo pixjcodusa roi^otedt 6^^ tliroojjli tho data 

oftd point to t!iO coaroat cl^jotor* iitc 5)* It can bo 

ohouu tJiat tbia iafoosiiiapo coxivos’jjoo to a •stabXo aoXtttXco. wliioJi Iti ; ^ 
infXtonccd only BXifjhtly Ijy tJiO iaitiaX ootiaatoo cf t!io olisstai’ coatiflD* 

This toclud^uo toe boaa iiiplcraontad in tlio XAIWY3 package of pcaoto 
oonsinj progPiVAa X972)* 

initiE^ ^V3to? cej^ Hithin tho 
•dXoivi* of pointfii ^ in tto ntator of 

obscswatiGiis and a tiiO H*3 avbitirariXy fteaicj^iod to 

g ffl ** Pocaupo the Bto^o only tha clt3«<07 

CohttSsSf cjkV coa^ tJjD otocinratioii polnta ca a rvUlniatXo baac 

v.p* st-22c (n2-”ri.*'tic tape os diao unit)* tjun procadupo tfa& ca«ablo 
of hanilih^ lassii ritobo^o of pointo vi‘wi e.aso*__^.. , 

'•roMicas oa-tinny civatosiras ara 

cojitrecc tIv?jjo citsntom o^y ovoslap* *aad it ib thhfi hoeooadJi^ 
to ■•::■. t‘'*3 c'i br!r;-crn p irs o oluntoi-n ‘ to dsti-i-nino 

n:^-7 'arp \ ■ iP*Phlc!pa.. bf- '/ ■ 
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5Vis 


doi'in&HT lnt33>>olU9tar lv:ia td}?or^ l)don cootionoA* 

neat or tl'id titittiUoaol navtstisroa of tho dlol^sioe botisoon 

iovorslOT oC oi^r oia coriiiTO 

OP too different V orianooKsovsrlane^^ Vbs oalcidatioa tiss 

TOquiPod to icr/ert a looat a s (n<*l)/2 s£;.tzicoa idtli ddssneiioao 
IS3 2 l.«^Go;3(3a Tpoblbltivo ar4 also inoPoaooQ tho ppaco zociuii'onents 

eoncddcst^blyt 7Uus oxt tdltopmUvo coacuso vsa doTioodt viilch voa <2u> 
Qitsr bio diat-ROo t!io t;io centi^f, \;edSbted b/ tha otsritGrd 

deviutiona M vdclcibloi OKio# It io ^oa ter 




n 


^ jjT> t S a -.it 

it » 1 r dr (T y 

^ dk * ik' 


Tw ‘*'1 

Khfiiro ia. JI;s' ^aa; 0 oa v Piab3o 1: 


. S.t*> 




o ' la uio akiidij-ssi JcvJUivion th3-i: '; oavVftpiabSa 'fe 

:■■■:. .'-'ip,. feO, .njUfep;;.©? V-; 

’2zli. rij..c.pe on a ;brrle»b^^ if* oji tha 

avor;':';??, : tl'.o .eSaiafeiiya;' :■; 21; has a lazep bciad „V 

0*5 cooa'to 

Vi^eacat' elt2:i3Po: Heuavor, no 

%« a V*.* .4 i c.U.';>-.^j'’.;3 V SI £-..-13 alictit; tliia 


•| I'-Jf' "5 ■'5>.^: ■ ■■• ,■ 'i* ■ ■ ■•l'rt -.-:./.-i, .■ r.y.~: 
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Upin^ tSiio lacpsuffo it 4o popolldd to ooaatruot a •oopv'irabiltty mtrl:: 
ohoiiin^ tho diotrinooo l>3ttrcon oli psi9?a cf clmto^ ob»trod* Vlilp eiatris 
lo oxasrtLiipd laixuually^^ end su'o^ootlvo lutorr^ctatioa ln>/ol7od troa 
aided by t(d:ln:7 icto ^coowit tho oputial djbtFib;7tipn of tbo clooslfiod 
points* 71io:.'o tU*.o a conoidcvablo do^^’oo of rjacdom Xoft In 
Itt'OprirrttiOtt of t!^ CllVil OlaS£)i.fiod mp» 


OJhio laottiod v-na taied by Kacy & ^oloba (lS72)» baoed or en alcjoritlaa 
described by norjioi? (XS;{J4)* TIsio aothed oKtaiiaa cadi point in tvva» 
end oaloudatoa tho diotanco lotwoon it oad all divitero* If t!io point 
ib fibptbor ^ ibx’oehold vrAiw» Idisn 

it f eras R ao’<j ciuotoi*. If it ia thrJtt. t;i;ls tliroshold, it io 

oddod to Cic cltt3-;«x*t and tho clvntor esati^ raoalculafeod* Kar;/ and 
Solatia discuss isiio two ef o •psdoid.ty list* uMcli psraito tiio ac-arciiia,? 
tl)TOucrh tho lijit of clufjtox'o in an effiG?/*iit r;an:io?i and also tbo deletion 
of ciuators iKtich de nut kx’Ot^ and tiiuo i’OiJivaont anoaalowo points* i’Jio/ 
fV4?t;h.>7 eieouao^ w^alr;a{rivtin'; siiilar points into •fiaiu-'j^ 

bofoyo ttey -?>3 coiipaysd to cluotoya* ^?hlo toohrir^no ia not G.p::licablp 
in this casQ» sir^o the data is nlroady In an aiajsjs'iatod foyni^ tlds 
noiiiieation 1X23- ^ A i’vrthur aedifiention tJiat tiaa 

Inplcaoatcd for this anxdysio wria tho rcccsrdias of tho dolotod clustoro^ 
end tJio oubso *.vaat allocatiGn of tho dole tod <^notca?a to tlio raroaisinf; 
cluo'CC?."3 nt i>jj erd of tho m'>lynxn« 6jt ' ■_ 

'fiiio tcoU2i';U'3 than roiulrGS only vi:o onr^o oyd^'v of cpacc* r.o 

tho Itorativo rl'Toritlinj* but bocnt\30 it Onwinos each data point cn!^ 
ci'iCOj it io aovT.r^i t.lr;o3 IViavox' in G;.ccaticn* . , ' ■ 
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Cno jjU’-tijSi' tiial; woo adof->tod I’or liiis iuv*c3tication uoo a 
tooJjni'Av-* tnt olayly isv/oIvc\l d:o visual ituixscUca of ccattor 
iW ,'ov ftjch fiuld the ci'2:ii vslua for each bend ii*aa oalouXatGd 


Olid valuoo for each imir oi‘ haado plotted aa a tuo*»disea3ioail 
ncatwor Cli. .,'^cn* On ■liisao tUaGria;.a» cits tors of i^oiato cm ba 
viouully d./L'ors:lncd, md tlBnna?r<?d* Jhis tMchnif^uo allows tl)0 
rai'ld .'••^'o’r.stlca of a classified riap br.ct.d on a rliiiivA'a naomt cf 
ii'Sor,:iation, which is potontielJy u;icfttl. 


5d) 


'' -r . • V 




To ov ra".rico this ^jT70ati;jaticn has avnilablo fow Oluatoyii:^ 
tcclirJ/noop the hlor;irchioal, clasaical not’.td, tho itcrattvo end 

r-.?t’;cis, rnd 1i:;o vlcaal ?.ir:rcctioa cf cGattor^plcts, „ith tiiis 

/ 

body of wo aro in a rosiUca to ermita ooea rcculta, 

rli’ctf h wev.^r* it is aocossa:.^ to discuss tlic data avails bio* 

6) 


Throo 00 13 of data TJoro avoAlablo for too cmlyoisi two cf 
canti'rl r.’iiroi'rJ.a rrfl ono cf onstoni ih3::laril* {'fablo 2* fics 7tO»9) 


‘T- • p ..T *, ‘--t ♦*» «. •/-»• 
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cf "lo cc;;iv:i ’.’"iloy, c'iar'.ot':ri::c.i by irrirjatod c:;:slcul’Ln:ro* 
ini^oo arc c:‘ oarrc:iti.;\l ^;7 t'o O'L'io area, rrro t-Ixa in .C;,;tonbor * /?» 

1^4. ^ - .p •*• • »k ,p.P . V’ •« •X- . . •*.-» ♦* J %'» ■ ~*v. ..*■,/ i.ty^ t •t V* 

. .. in';o d r!;:# 
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Si'u 


Tho liiidGe of o^aterii En:;!Liii(i laroridcs a confcraatinjy ccana^ 

* ‘ * much Icocr qiiolity. , Tho low rocolution cf tho ecaaovo and 
tho haay ainoapbiro contribato to fn inaco ;S»5ch is vlciially vory 
poor. Howcvor, li-jo ability to uoo all tow rry'^'ctiral bands 
Biuxxltanoouoly remits • bo onalycdo to dif fcrcnti-; to foatvxos vhich 
ara not obvioits oa any of tho olr.jlo band imafog* 


*?• ^R o .^.n lt o., 

Soao roaulta of ciiplyiAT i^ioco tochniiiU'is to iio ioaees txo 
aho;ra as flco 10-14» and tablo 4- acta as a to tiaoco onalyaos. 

Tlicy Kiicrald to coaparoi to tl>o iu-^rps fro:.i they i;ora dorived** 

fica Y-3. ..lthov.~h "Uas plott'jd clacairicatlon? liavo bo_'a (iCiO!a:-a'lcally 
corrected th*cy do net caarlay tlio Im ,C3 diroctly dus to a lac!c of 
correapoadanco batvroca tho CC!To and t.io iDsacory,- Tho imceiy has on 
alon;^*trfiC': dicp?w.ioc-';eat vit’i ti:o renpet to the CCTs of an avorar^ 
of 120 sern Ii;a:3 at tho north rjid about 50 scan lire o -on tho couth. 
Thoro is however no croao-Doaa diaplcacaont. 

Ths first riajor ccuolx'aicn that cm biO draarn fron all tlio 
ainlyncs iu that in ;;:.aricral bctacion 4 o-ii'i o cre-l'o cotild bo vrocbjccd 
by tlio'jo ti'Ohrirrp.o.o* l.liarcvar i osriblo tho cnalyaos havo boon 
proofented with a atordai'd jrxd-or of 5 iiroupa., Tliis otuidardisatlca 
waa less easy to aclvi.oyo v/ith tba itorr.tilvo tocimia.uo vmeU if 
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7a) 


Tto 200 X 200 fields woro clasoJfiod iisittu tho Monarchical 
teohnlquo, vsinff tho Morarchicai teclmiqivoi tnldcff ths data for all 
opsctral b:’.rii 3 » fer each hard ccrayatoly, and ooch combination of bandSi 
The xcsultc uoin^ all tho data, band 4 only, and bands 5 and 7 
tosethar aro choea in fig. 10, The clasoification vaic 3 all tho data 
(i,o, bands 4,5, 6 fi 7) can ba conaidorad tho standard, against vdxioh 
the performance of tiie reduced data cots can bo coaparod, 

Tho clacsificaUca uain:; all tl» data ohous a roisarli^abXo 
opatiol cohesion to tho reoultc, and all tho foattjroa oboorved 
can bo int>-;ria*otC'd eatiofactcrily, v.'orhinj from l.iJ to b.l, tho 
Croupa voro identified as J l) tho hich diCiira hovada, v.ith its 
Cover of bare rode and lii.cjh nountrin vc,'jotutionj 2) a 

Eoiio of heavily foroated oountainai 3) a cone of mixod frpasaland 
and wcoJlanl (C],;:;pei’rel), vhicli aloo rccin'a in tho Coast nrjvdcoj 
4) ao tho '..’cody cosy.onoat of tho frerlcy-i class beconeo -isoro 
ocarco it is rr;piaood by e. anifora Dry Grasoland vluch blankets 
tho lar-docapo in a uiiif era layor i/liich ocenro as a atrip dovm both 
cldoo of t!;e Con;ral Valloy, ard ai£.o in the upper Kcntcrcy valley 
in tliG emtreno v;outh-.;eat of tho inacoj 5) tho renaindor of tho 
ina^o is talx-n up by tho irrir;atod a^ricultuvca..aroi of tho Central 
Valley, It io verth notiiv; tbr.t tho ax'oa of uTiOuXtivatod laid in 
the left Centro oi"’ t!;o acri cultural cons is correctly clr seified 
ao boXwX,_,.'*i.- j to tiio 




si - Vi 
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f.no obvlovjn on t3'40 ts^-x'^or . nro not ;'id:cd 
out by t'n 'ii:d i.'ocivIr'C' fuvifici’ 

vLvb.ji..‘'.va';a cf i'^o.vr'.o toj-i cvt fidi*ly dist:lnc’lit'3 

rc,;-trvc.:ff {.-id fijVwV::X Lr:V>llc;;* to'.jno C:v. lvIco bo dio tiny vTljlicd, d*ioli 
n:>t aot-.;3voci by tiio nrM^Jyr.ca, boc.oi'u/j tJiey cro 

ttyct^Aiv.lly v::v‘j cllffoTOat Dio a(’<j :C{;Ut aczlcultvr^ol'oonon* 
tliuy rrji not o^octE'.illy diotlnct* 

i d :*;•!. l:;rly tin rorjorvoirn th'it arc n vony r^oalrmt foatxtPC 
of I'ho :‘'-:o'jbi!ila ?.r.!,l of tI;o cooat rrjiye am not dotootod, fear altlicar;]! 
vdou.ally •■■I'Cirdnont, thoao rc.aoivoiro arc aroaiXy saall* covor only 
airai ori^cavj of fio:.dc Z:0 r. IlDO in aizo, An*<hor, 

bOi..ijv; in ,':;onor.' il lc:vy nnl t lisig tarir i:‘..Clu.;r.oo cn\ I'.io y-cy ccalo 
ic oft^.n Oj^rcau. ov.:-;: a field a. 

Cl a:;;i’f.catitna uniny Ciija.'^o br iad-o (of xrbJoh fiy 10b ia on c-:;'^;:^^ 
\rore cd; •.idi'.?.;b?.y ic'-’O fr.’nce;n.x'’al, fM' c:i'',y t’lo Cb?.j-:jfna.l» rdi'oat, r,:jl 
a^ib.liviaicna \:cvo cvm Ool;l:altatci, Olio 

cf b..'i:;do 5 t; 7 {j.iij lOo) r-rcduccn x’oaconaiblo nocuIta» 


L<wv 

c •! nil’ ;. r,l livifn 
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rciu30 f'.o air.o tin fori.v.'r.’! cot Vy co:...Ucni’iy only a auboot cf 
b.j)*/; c.xz net libfly to bs vaaoe/rnfalf 

arefl- : *•; •:oo vnlnea .f i'l'i f’eUe fii? '".ch raiy of c cct l 

* rf*. ' ’.i* {. a (j V V *i0 I .» ., \ A 9 .4.1 ^ mC'tva 


C* tJ. Uj 




-I.'... 




5?4- - A 


S /II 


\ 


EOiio, ultiiOU£;U Poroatod osoa of ttio iJicrrn llevada 


ia ecnfi;cr.d '.dtii tivj Gracslf.ad ccii^u 

t 

X:i.j X’C;ralt;:; of r.pplyi::^ 'liWJ ivCraUvo t3cJu"J.r.uo to fiioldo 
or (lifrcPiJX; zlv.O'j ic Dliotni liy i’llt'T 12. .athov^’li rc^dorv'aioaiicn 
200 200 fields trjia not Moitlcal t:» produced ty tlis 


hi 


tear 


’liqucj t’ao ,'O'0e.i?nt *.;ag G^ed. 


Tho colinit'-udsa 


of tlio Or-iscl'iud cii'Jil Ch-parral boitjo on the north o.Tcten. oitld of 


the cdtr;.'.! Vvalloy is slnoyt Iclaiticalt altl:C'Uqh thora is a coafusic;! 
Vott.’ocn, t!',e r.ir;h rlcrr?. and I'lo Pry Craa/rlrndn# Tin n.'jz.’iGUltiT«'’J. 
oroa is diriaod in?;o lnx> BOrwa, ini 1mm* oae coadotir^ of czmllcr 
fiolds (s,l:Cirir.:j rol en 'U-j faloo cola'ar i.*.co i'Uz 7)» mvi tiio o^Sr.yj^ 
(iliQ'shxj tho Of in'li'atica ua.jOi’j ahc.;inj bl'iJO on tin irx-::'-. 

Veins Vio si-uinor ICO :i ICO fioHs (fls 22b) tho ov;ralX .u^.ttOLa 
io “Uio zcx'iOp l.roeci’TTius uiroo-xcid divisicn into a.'pi’lcsIv.Tc# 
^•aoo3.a-.Ciji xcx’uecj Vnt c.io '■^.i’j.CvilvLnrul coji nomi«x.u-«a 

Bcrv^a £U .'0 siw'Ci»vi*.^vwd# It uo oxxo u*-*3 oi s;*axlwr 

flold no'c . redueo n clvjt^^r nr.'- of Ido e c.:o cl'onoo 
hs'ctor ri?s-.ito/2.oa( Vi.t ra'dior a uaiiaLOiiiJ Jlnl Oa cX.tes^awCavXcnc 
Ao the ri:;o oi tire field is tli: liddicl so tin ci-uvlyois begins to ;i(f: 
out irdiviluol con o^nntJ of tI:o ic’-.loonpo. I'hio effect in even 
T.W n .-ini vfnn ccmldoriiv; -n EIO :: 50 flclclj (fls.lOc). On thin 
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7b) 


5?ho ro» 5 ult 3 oH both tlio Idcrarchlo&l and itoraUvo 
toclirinuoa applied, to iuixr^a K aro uiuiU-J? to thor>o obtainod froa 
ijiaco D* Tho oiig na;}or OKConiion io tliut ’Un ^ 

tho adjaoont c-^nfx^lo^a aro not Oifferentiatod, roHootic^ tLo fact 
that in errinS'-’JiSi tiio c^cs^ol^i ia irro’.dns vi:*ovciwly and io tJiuo 
spectrally drilar to tho irricatnd cropa of *Jio valley floor. 

' Tl'ig offset io V!^rticul:a’ly narked in th© in-r.U'od ina<;3 (xa.'j.wb). 

Dhvs in thin c::>ly"i3 tho cooaon:il v a’,1 ition ccnfnsos tho cUoolficavion 
procoduro, douoiiatratinj the noccaa;dty of both uoltiopccirra and 
culidtcnpcral fata for a co:’.?lcto oivalynio. effect Xr. 

viciblo in both the lucrarehioal c.ni itoratavo cv.r;.;xxJ.C-tici.*., ai-i 
tills conaiotort bohaviour in tho face of a data chraije doucr^tratca 

their validity for fee application. 

The rco.-.lt cf applylr:: the chain nl'i>ritha to the ICO s 100 
ficlcLo fc- ina-;e U ia oha:n in fi^ l^e. ^he r :oulta aro noro 
confusocl th.an for sithor of ttio othor cuialy.:03i altnousii ths 
genera lii '.03 of tho tlorocfold divieicn into aa'i'icultnrc, 'xcieolaa 
neuntan io viaUo, Vhie rcoolt aven the cener-d inpre^enioa 
of a noro 'ncioy* clanaficaUou. hie/thor tliie roeat i;ae obtaned 
oay ator a oartan a-oint of nith the tlercahoXd ^td 

olmtc?’ deloivLO’' critoviiif -fhj.oh Irdic tco thav 'l.•.iie tocinn.n..o .fcnld 
not be cat-itio for u eiip:‘"aced rp'-li cation cn a rentine Iraxn. 
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Tho cltanaii’icairloi'is of imco C of caston linslarjd (fig 14) 
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CTO ,'ln i' ^Lcss cXoar, a?id tho tzoai icrsiotont foatui’O of 
too ir^oulta if too oLviouo divinica Itot-.rcoa lani and coa. 'i’ho oiiaplo 
hierarchical tPchnj.ou9 appUod to tho 200 z 200 ficldo (fic\ti*o 1 /!q) 

ohovre thrc.a diatirct .'■roup? in the traterj tho challou water of tin* V,'aoh, 
tho sod ice nt pliue o-a.^natiiv: froa tho huabor, tmd tho clearer ’rator of 
tho J!crth voai ^i/jurG 14h waa proporod uain^ the c'^sal^Or 100 2 100 floldc 
It ia cnnfnsiod on tho west by tho cloud covor ard the ace&.;ir.tcd cloud 
ohndowa, but over tho wot of tho ima^io oevox’al parts of tho land aroa 
aro die ^xu^ulaacd. Tho soa a^iain difiddea into the darker rior© open soa 
and tho naxn'owoi’ coastal water with conoidcrabla ti-rbidity, 7ho fjolda 
which coicprise rou~hly equal portions of lard aid soa are eo diof.Kctiv 9 
tliat thoy fora a ooparato category wMch ia al::ost eonfinod to: tho--’ 
coaat« Two other f-T-oups of areas fall into tho cans cata.'jorv; it 
includns a coiibiiv.tion of cioadj aid cloud sh-adcTja but the other is not 
a otatictical lU'tlf ict oinso it it? located prccirelj' over t’;c corif ircua 
woodlands cf Cannock Cluisa and Glisrwood Feroat, Tho rent of t!in land area 
01 v; dec into two unCs„ual portiojiC; in ronoral tho cj;vJ.lur of thoco is tho 
noot craEie part cf tho itindscate, notably lest I.orfolk and the fens cd/^o 
couth cf Lincoln, but th; rol.?tionc;ii.p botwson tho proportion cf r.rablo 
Imd and t r,.': dlv;'...*.'. c'l xa nut cou'ir. t;.nt acrcas ILiO oni fC.iU feu'thcr 

unknciwn f-’ctor r.uat be involved. 
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o) iu«cut;.'5ioi: 

or tlio c.lr’.PDificatlpn tcchni'-.uou «nc;d on foi’, liy tho fjir bsot ia 
torrja of tho intorprota^dlity of tha rcoiiJta io tJio olcosiicr^l hierarchic rl 
nothoil. Tiilo notlioi ICf hovovor* oovcrcJy llsiitod by its la-^hility to 
yuindXo tiic 100 X 100 pixel fj.cldn, which rcdxico ito utili'^ In dealing 
idth oc"lo lnndC!C"poi:i* twch re tint in tto K'aat of Tlio 

Chain rljoritlaa on tho other honi io hi' fijr tlio neat offlc lent in 


torna of tlio v.of.30 of coineutor iir.c aixl epaco* but produces conaldorably 
lovror rur.jitir roaulto. Th'i iterative tocluiiaue pootao a cooi 
coaproalsOf but it ia hard to 000 how thia could bo inplcsontsd on 
a GpGOi'sl purpouo I’achina for vouti;i.» r.pplicutiono to tho E.'-ViJ.i im';Gr5% 
no\;cvor, all v’.iogo oiviiterii''; tochxiirivou tr>ico conoiderubly loa-:. tino to 
execute than t'.ic fiolti ard fo tiiro conpilution oti;*;o3* 

Tho inabilit;/ of sin:;lo bi^nds to prc-.luco offoctivo claonificationo 
pointu out t!io voluo of ruin,; tho nuitlo n'.itvro of tho d*ita to 

tliO full, £.nd ti:a Do:>>ooaal cfi'ccto noted do/.ono ti’ato tho need for nr.ilti- 
tompor.il dota* -’{owovor, tho full Ifd-varinblo for.turo not hue come 
coa'jidorublo rcduniancy, and oxeoriaontu invoUdn:: ito roduct-lon to a 
64 vr'.riahlo L'jt .rodueed idc;:t.lo:.l c.lao£iI,f;ic:'.lion'3* ’.’'O condivle 

tliat roduotlcne in tho aiue of tho rc-aturo cot aro boot roada by 


rodueirv'j tlvo nvabor of variabloo for fspoctrul harid, and not bj»^ redruoin'^ 
tho r'.v'dx'r of h-'r'i.d''* 

nffoct of vn^/iupT the- I'f-oo of flold w.ocu i- not to v '.y 

the roDO'ution of tho achieved claoru'i'ic'ilien, h:t ciotcr<^lno3 tho tyro 

> 

cf ciaau vuoduoad, th' ch:lco cf an •.-ru'Ovri-.tJ aLr;o detendj ca tin 
scale of •'h'j fC'.tv.rOM in t)i3 l-uidscarvO, and thun tho SCO x SOD 
fic'Jihi '.5tch r!'-.;tt.''nl'J ihv Ca.liJi’C’nh^ i i.U 

caot<san nhrro ti3 a::;lic" f.lcl'C': ivr.) c ^rectlTr. 
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VhctfiO aU b.iori dcv'^lopoA in thr context of a 

Ifirje i^cs^exv-l rtu»;oao coapxxtor. tot il cauputo? tiro to 

oonpil'j the Toatu:.’'? fi?-'-':) rntl tn cirfr^aify ‘liun on l!o >70,'3.C5 
ttt C;xiU'iC £:5 La c2 tiro ord'-JJ 5!0 rrinutoo oouti-.a proocotilita Uio, 
of vlrich. a ?.: rjo t-.novurt ia trJrca ir? viiJi tJio te.ro h:inaiir,c; rmitinoo# 
Obvlourjiy Tcv rouU.ro eippUeatlcro eotro clocor apprcucU to rori tlJ33 
urniima io 3 .-civdrci» trvd hero the way for.?ai'd ecc-r^T to bo vitii cpociri” 
l;ur:'c^.;a ur^nll C'j:::utoro, In 'urlo cor.’j>:::t tlic r.oat ico::lclr..r tccluitn’.o 
oocno to bo tho ck-J.a oicv'ritln, uuch ro-iuiroo littlo otjro r::d 
o:r.-';v:.irva 'll;-: dv.tr only ao it iz rr:vi is;. Vo c:.u Uruv cojr-luie 

tJrvt, at l.-vaa 1; vV, Ml ; ri>oc:it vyivor-at! c:: of ec-vUt<jro, the v livr» of 
d.-ta to bo rrrooorvjd vrill f:rco uo i) uv? toc'u-dn thr.t oro not 
n?cv fjcvrd ly Mio bo it, but uhvch offor ivao licpo of coyia.'* v-lta t-iat 
vaot ',u.o”d:ity of d.v.ti* 
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A COMMON U.K. FORMAT FOR ERTS DIGITAL TAPES? 



A. C. ARMSTRONG *nd I. E. HILL* 

School of Environmental Sciences, University of East Anglia, University Plain, 
Norwich, England, 


The need for a common U.K. reformatting scheme for ERTS digital data is 
presented, and the merits of various schemes discussed. Practical details of the 
reformatting are discussed in detail, and recommendations for a possible scheme 
are made. 



I. INTRODUCTION 

RECENT DISCUSSIONS HAVE suggested that a common U.K. scheme for the 
reformatting of ERTS digital tapes would be appropriate. The following is 
intended as a basis for discussion of this point. 

There are several disadvantages associated with the format used by NASA for 
the system corrected MSS data, as is evident from the number of users who have 
found it convenient to reformat the data before further use. Some of these users 
have taken advantage of such a reformatting step to modify the data, by introduc- 
ing, for example, scale and geometric corrections, which may be useful for 
specific purposes, but have little value in the general case. 

At present there are only a small number of groups in the U.K. who are using 
the ERTS digital data, and thus if a standard reformatting scheme were to be 
established now, it is more likely to be adopted by subsequent investigators. This 
would permit a considerable saving of computing effort, by allowing the easy 
transfer of software between users. If however the wrong decision is made at this 
stage, the penalties are severe, so care must be taken to ensure that the adopted 
scheme does not preclude new uses of the data. 

The most awkward features of the NASA format are: 

(i) The segmentation of scan lines into quartets, each of which is stored 
on a physically separate tape. 

(ii) The interleaving of the spectral data using adjacent pairs of pixels. 

Of the several possible changes which could be made to the data format the 
following seem the most useful: 

(i) The reconstitution of the scan lines. 

(ii) The reformatting of the interleaved spectral data. 

(iii) The repacking and reformatting of the annotation block. 

Each of the possibilities will be discussed in turn. 

The choice of format is however also dependent to a degree on the char- 
acteristics of the magnetic tape systems available, and this is a factor over which 
the potential user usually has little or no control, i.e. he must use such facilities 



* Now at Experimental Cartography Unit, London. 
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A. C. Armittontand’U Hill 

u are available to him. , . . 

Using 9-track tapes at 1 600 b.p.i. it is possible to store a “mpletc taage as 
4 sequential files, identical to the original NASA tapes, on a single 2400 “•taP'-. 
and this saving in spa< i is obviously an advantage. However storing all the data as 
separate files on one tape means that if an area covered by two or Ouarf®*" 
strips needs to be examined, accessing that data becomes difficult. Further, w 
availability of 1600 b.p.i. tape drives is limited, and it seems advisable to defme 
a scheme that will fit any system, and which permits the storing of an image on 
more than one tape. 

2. THE RECONSTITUTION OF SCAN UNES 

The reconstitution of individual scan lines has been the most commonly imple- 
mented reformatting scheme, as it permits the examination of any small area of 
an image without accessing several tapes. It further makes th processmg of an 
entire image as a unit very much easier, since most image processmg systems 
require an unsegmented raster scan. Ustly such a reformatting 
to the form in which it is collected, which seems a desirable feature. However, 

such a scheme raises certain difficulties. 

Such a scheme requires a computing power that may well be beyond that 
avaiUble to many U.K. users. To compile scan lines from four Mparate source 
tapes would require either a multi-pass system with tluee tape drives, or else a 
siMle-pass system with five simultaneous tape drives. The reformatting can abo 
be done using a single tape drive, and holding the four t.apc files on 
disc store. This requires more disc space than is usually available to 
although the Image Analysis Group at Harwell have indicated that they can 
perform such an operation on tl eir installation (Carter, 1974, personal comm- 

**"'AnTuernative way of implementing the reformatting scheme would be to 
create a small central special purpose computing facility, dedicated to performing 
this task. If no further reformatting were required, this systern could 
mented with very Uttle direct access core, but with either 
a combination of a single tape drive and a disc pack, i essary to do the task. It 
would be possible to build such a system at an estimated cost of th® 

£30 000. Such a facility would, however, only be justified if there were a greater 

use made of ERTS digital data in U.K. than is now the case. nf 

If the scan lines arc reconstituted, a second proble vi arises as to ^ 

those reconstituted lines. At present, the quarter scan Ur.es require 3296 »-bit 
characters to be stored. If the reconstituted scan Unes were to occupy one record 
they would then be 1 3584 characters long, and would require large buffers whi.h 
might be a problem where only small computers arc available. 

It is suggested that the reconstitution of the scan lines be achieved by wntmg 
the existing four data records sequentially onto the tape, instead of writing on 
long record. The reconstituted image could then be stored on as many tapes as 
the user requires, by splitting the image at given scan line intervals. 


3. REFORMATTING THE SPECTRAL INFORMATION 

Within each scan line, the data at present are stored in 8 character groups, with 
the spectral information for two pixels interleaved in the order: 


(4, 4, 5. 5), (6, 6, 7, 7) 

thus making it necessary to read 8 characters in order to gain the spectral informa- 
tion relevant to one pLxel. , , 

Two reformatting schemes seem possible, either the data can be reformatted 
pixel by pixel, or band by band. In the first of these, all the information relevant 
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to one pixel is stored as successive characters in the order: 


( 4 . 5 , 6 , 7 ) 

I 

and in the second case the four records for each scan line would consist of the 
complete scan line for one spectral band. 

For most digital applications, multispectral data are required, and a pixel by 
pixel repacking seems appropriate. A band by band repacking scheme would 
however be more suitable for pictorial processing, but in view of the low cost of 
the NASA photographic products, this advantage does not outweigh the increased 
difficulty of digital analysis. 

A third alternative that has been adopted by some workers is to produce tapes 
which cover the image but contain only the data for a single spectral band. Such 
a procedure requires the compilation of a reformatted tape from four separate 
source tapes, and the problems already noted with reconstituting the scan lines 
equally apply. In addition such a scheme effectively precludes multispectral 
aiulysis of the data. 


" i 





« 


4 . ANNOTATION AND IDENTIFICATION RECORDS 

The repacking and reformatting of the heading blocks on the tapes would result 
in some increased efficiency in the reading and processing of the tapes. However, 
these records form such a small part of the total data set that it is felt they are 
best left alone. 

Once scan lines are reconstituted it is necessary to reproduce the annotation 
block only at the head of each tape used, and to include such information as the 
reformatting makes necessary, for example the number of the first scan line on 
the tape. 

Further if the heading blocks were to be reformatted, it would probably be 
advantageous to add to it data not currently present in the header. Data concern- 
ing satellite ephemcris is available from NASA, and it wc'Id be useful in some 
circumstances to have this data available. Thus if this data could be made available 
it would be as well to add it to the header block at this stage. Probably the 
easiest way to achieve this is to add a further header record at the beginning of 
each tape, into which the user can place such information as he cares to add. 

This block can, of course, be left empty if the user so requires. 


5. DISCUSSION 

The major advantage of reformatting the data seems to lie in the reconstitution 
of the scan lines. It is felt that unless this can be achieved the best course is to 
leave the data as it is supplied by NASA. However, this reformatting can be 
achieved, and it seems reasonable to put forward the following scheme for the 
consideration of the remote sensing community in the U.K. 

1. The annotation and header blocks be left as they are. 

2. An additional record block be added after the annotation block to contain 
such additional information as the user requires. 

3. The scan lines he reconstituted, but left as quarter lengih blocks. 

4. The data within each record be reordered so the information for each 
pixel appear as successive characters on the tape, 

5. if demand warrants, a small, central, dedicated computer be set up to 
perform this reformatting. 
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METHODOLOGICAL QUESTIONS IN THE DIGITISED 
ANALYSIS OF ERTS DATA 

A. C. ARMSTRONG . ^ 

School of Environmental Sciences, University of East Anglia, Norwich, hngland. 


Methodological questions that arise when using ER IS digital data to derive 
'land types’ over brge areas are discussed. Some of the disadvantages of per-point 
classifications as usually implemented are outlined, and the advantages of per- 
field classifications discussed. The superior performance of an unsupervised 
classification scheme in an exploratory situaiion is outlined. The advantages of 
an unsupervised per-ficld analysis arc, however, gained only at the expense of 
greatly increased computing effort. 


1. INTRODUCTION 

IT IS THE purpose of this paper to examine some of the methodological issues 
that arise in the digital analysis of ERTS data for regional studies. This paper thus 
acts as a methodological comp.mion to the results presented bv Armstrong and 
Clayton 1 1 ) and the technical discussion of Arm.strong (inss, i97S). Although 
developed In the context of ERTS-i * data, ihc discussion is applicable to the 
analysis of any remotely sensed multispcclra! data. 

Two major points will be examined, the advantages of analysing groups of 
pixels rather than individual pixels, and the choice between supervised and un- 
supervised analyses, However, in order to give some backgrciind to the paper, the 
ideas behind the invest ieat ion are first outlined, 

The potential of ER i S to cover the Earth every 18 days has two main implica- 
tions. First'y, the sheer volume of data introduces a need for automated data 
processing [2] , and secoiudy. for the fir.l liiite, a consistent data base is available 
for global scale investigations, offering the possibility of mapping features over 
wide areas in a way that was not previously possible [31. 

With these considerations in mind we tiiive been concerned with developing 
a scheme that has the potential for ciassifyieg areas covered by several ERTS 
frames. Tlie classifications would be of tlie kind to permit comparisons with 
those shown, for example, in regional geographies and atlas n ps, It is not known 
a priori e.xactly what types of feature can he distinguished i.' this way, and we 
are thus in a learning situation where we hope to pr .'ducc classifications and then 
to a posteriori identify the classes. 

A problem that arises in identifying the cl.isses is that the landscape is an 
integrated system, and tlsal when one eiivnonmcntiil variahie clianges, then others 
change in sympathy. It thus becomes difficuU lo know whether we are mapping 
soils, geology, topography, or vegetation, since llicso variables arc so inteirclatcd. 
We thus cannot state exactly which variables we are mapping, and call the 
resultant classes ‘land types’ without attempting at this stage, lo further qualify 
them. 


* Now called LANDSAT-I. 
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Methodological Questions in the Digitised Analysis of ERTS Data 

2, PER-POINT CLASSIFICATIONS 

Althoufili nci-point classification has several tiii-advanlai’cs. it is the •erhnique 
that has been invcstiKalcd tlic most Ihoroiii-nly, so that it now a lelalivcly 
routine prcccdure, support eel by several well-known sol (ware packages, and even 

some special purpose hardware, j * i •, 

The main feature of per-poinl classification is the tremendous detail il ofltrs. 
When remote sensing data were collected only by special t lights of special 
purpose aircraft, and was thus available only on a limited basis, d seemed better 
to analyse the data in the most detailed manner possible. However, wita the 
advent of satellite sensing systems, an almost overwhelmmi: How of 
changed the situation, so that the choice is irffered of low icsolution broad scale 
studies, or high resolution detailed study, i .adi FRTS frame gontams approx- 
imately 7 inimon pixels, each referring to a ground rcsolul nm ^ 

79 m. This level of detail means that it is possible to classify only a sniall ..rca it 
we wish to present the results as a map with single pixel resolution. V/e note, 
however, that it is possible to classify any arbitrary large mca to jiroduce siiium- 

**^\^*secmurmajor feature of the pixcl-by-pixol cl assificalimi is that because it^ 
considers only individual points, it is very sensitive to noi-.e m the system tr m.cr 
normal circumstances the noise level in the IRIS data is low, but in some ta.,cs 
it can become quite large, For example, we h.ive been cxnmmmg the imagco ot 
the Wash area, in an alteiurt lo define are, is of scdmient-l.uhn watei . hoi ims 
purpose, tlic data in band five contains the most useful mioi mation. and . lu. c 
fdund that the spectral levels associated with turbid and non-inrbid water diikr 
only by about two gray levels, which is only just greater Hian Ihe noise m ioduccd 
by the variability of Hie six sensors used. Ihe iLSullanl classihed output thus 
shows a pronounced six-line streaking. Thus in this contc.xl at hast a pei -pom 
analysis can be very sensitive lo noise in the system, fhe siUialion is i V 
worse where examining data that is inlurenlly more noisy, as for examp.e the 

S-192 on board SKYLAB. , . , t- 

A further disadvantage of the pixel by pixel approach is the very . 

amount of information available to the dassilier, whicii m Ihe case of 1 R lb is 
the values of the gray scale in the four spectral hands. Any cbssificatum scheme 
can thus only use the Spectral signature’ of the pixel, which can only be ated 
to other ‘spectral signatures’, as a basis for classitieation. Because of the dill- ^ 
iculHcs associated with building up an acceptable libraiy ol spectral aigna urcs 
that has any general validity, those pLxel-by-pixcl approaches usually require the 

input of further ground truth for the image in question, 

By examining each pixel in turn, without rclerence to other data points, the 
classifier ignores one of the most important features of Hie data set that d is 
spatially organised. This spaliat organisation of Hie data is important m the w. y 
the eye recoimises patterns. For instance, when wc recognise an a^icuhma an a 
in an ERTS scene of Hie US.a , it is by Hie airaiigement ol Hie Helds, am. not l y 
the spectral characteristics of the crops (which we probably cannot identity n 
any case), 

3. THE PER-FIELD APPROACH 

Because the per-point analysis of remotely sensed data ignores the spatial aspects 
of the data, a pcr-field approach offers many advantages. Not only docs it cor- 
respond more nearly lo the v.ay the ece observes an im.nu. 1 u*. it pcrimis a lar 
wider iiitu'c o! -uialyscs to l>e implemented. However, ttic apjeoach is expensive 
in terms o'f computer lime and core requirements, and is less easy (in gencial) lo 

implement on snecial purpose hardware. 

The cs-enti-.l Hatme cl the per-ticU apmo-ich is that ii com nlcrs not I'.isi 
individual pixels, but neigiibourhoods ol pixels. Usually the neighbourhood is 
defined -.a matiu o. i.,- . !• n ,x m m - iac, >• "tied ou'i the g- int m queirm, 
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It is worlli noliii 'h i’ il ii ii'it n ’ O" i';- lo !hmv' ;i"y ('f iin' rc' )lniii)n of thi? 
syslL'm by iisiii.4 such a jiroccilui c, lor \vc can locale one «)1 Ihcsc ncitihboiirhoods 
over each ptxe' m 'i.rn. This is ev-.vr.ti iliy the operation perfonne I hy Ihe local 
gradient operator, used to locate buunJaiies in the data. Jloweser, in many cases 
it is useful to u;:e non-overlapping neighbourhoods, thereby achieving a generalisa- 
tion of the data. 

Once tlie peiTicld approach has been ailoptcd, several furtber adcanuges 
accrue. Firstly, it is possible to derive many more I'eatures for elassific.ition than 
was aviiilal'le lo the per-point classifier. Indeed, one of llie inaior problems of 
pel lield classifications is tlie etnbarr.issjiigly targe let of potential leatures. 

One of the most important foauires that can be derived from lieUis, is 
information concerning image texture, Texture is a propcity of an image that 
presents a great many problems, for several reasons. Perhaps the most important 
is that wltereas the eye integrates data over an area to see ‘texture’, there is no 
adequatj model of what the cye/braiii system does in this operation, and thus no 
adequate stati.stical model of texture. Various attempts to measure texture in the 
literature include the use of various iran.sforms, sucli as tlie Karhuncn-l.oeve 
{principal components), Fourier, Walsh and Slant transforms (4| ; measures based 
on transition matrices for gray scale values at s]ieeified spatial relatiotjs ( 5 1 ; 
optieal-aiitilog techniques (6, 7) tiind the application of two-dimensional lime 
series techniques (f^| ;as well tis a large number of simple measures I'ased on gray 
scale distributions, gradient distributions etc. (9i . All these metliods, however, 
use only statistical models, and do not attempt to model texture a ptiuri and 
then calibrate the motlel. Some of the difference beteveen workers s probably 
due to the fact that the eye secs many different kinds ol things that arc put 
utuicr tJie blanket term ‘texture', and the different investigators are analysing 
different aspects of thi ; jiroblem. However, it is the experience of all the invest- 
igators, tliat the use of texture, however it is measured, yields satisfactory class- 
ifications. 

A second possibility that arises from the use of pcr-field classification schemes 
is that it is possible lo alter the scale of the investigation simply by changing the 
field size. We have found that depending on the kind of feature tliat is of interest 
a pai licular scale of analysis is appropriate, and Ihe fieki si/e ci’ii be adjusted 
accordingly. It is jmssible to envisage a hierarchy of classification scales, the 
lowest case being lhat of the degenerate neighbourhood eonlainiug only one 
pixel, and the results of one level of the hierarchy being input into tile next. For 
example, some woikers have discussed the possibility of using pei -point efussifiers 
as inputs to a per-field classifier for the identification of urban areas. 

Tlie list of further possibilities is potentially endlc.ss. Certainly the evidence 
points to the fact I Imt moving from a per-point to a pcr-fietd sciieine lias the two 
major advantages tliat il is possible lo use many more aspects of the ilata, and to 
examine the data at any scale required (subject, of course, to the resolution of 
the syslem). Ho .vever, to pay for litis greater flexi bilily per-field elas.siitets are 
usually expensive iti terms of computer time, especially in the time required to 
find the infornuition relevant to any one field. I’requcntly it is necessary to arrive 
at a per-field d ass'lii:ation via several stages of computation, it is dilficiilt to see 
liow, at the jiresent state of the art, a pcr-field classifier could be built to handle 
file flow ''f data fnnn a system siicli as FRT.S in anyihini; approaching r'.’al time. 


4. SUPERVISED VERSUS UNSIJPERVJSEO CLASSIFICATIONS 

The second major area of methodolonieal debate in liie analysis of F.RTS data is 
between the supeivisei.l and unsii|iervised classificaiion techniques. The popularity 
of Uic supervised cl.issdier ari.ses J.iriiely for histoiical reasons, in that these tech- 
niques were develope.l largely in llic USA for the classification of crojis, in care- 
fully contiolled coiidiiioMS, for which large ainounis ot 'ground trulii' was avail- 
able. At an early siage m the development of Reinoie Sen.sing tecimiqiies, it was 
useful and easy to examine conditions on the ground and relate these conditions 
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(o the outputs of the sensors, am! from this line of approach the irlea of spectral 
signatures arose. However, the difficulties associated with biiiMiH!’ a library of 
.spectral sig.natme,'. has lead to a '.ituation where an input of ground truth Is 
reepiired for each image analysed. 'Duis the traditional supervised elassil real ion 
scheme requires, the delimitation of 'training .sets' which are used to tcaeh the 
classifier. Despite the objections that will be raised, this scheme is known to 
work efficicnlly where sullicicnt ground truth is available. 

Several workers, among them Nagy rf r?/ ( 197 1 ) [ 10) , have poiirtcd out 
several quite fundamental objections to supervised analysis tcchniqnc as it is 
iLsually applied. 

Firstly, the training set is usnally gathered without any rek-rence to the 
assuniptiorrs of sampling theory, Olten the training set is derived trorn areas 
which are thought to be paiticrrlarly well defitred, arrd all boundary clfects are 
ignored, Basu & Odell ( DJVd) [ill have shown th.tl the lack of independence 
between sample metnber.s used to ‘train’ the classifier, can seriously affect the 
pcrfomtance of the classifier. Similar cotrsideiaiions also apply to the testing 
set’ used to evaluate the classifiers performance, 

Secondly, it is very much rtrore efficient to ckissity the data, arrd then a pcsic- 
rion Xo identify tlie classc.s produced, Kagy & Tol.iba (19 72) [12) have shown 
thai in this way it is possible In ciassily large areas witli coirsiUerably less input 
of ground truth than the traditiorral supervised analysis. 

Thirdly, am! most importantly in the context of FRTS, the supervised 
approach requires a liigh degree ol prior knowledge of lire situatiorr being sensed. 
The Bayesian elassrfier ■nornially used requires an estimate of the mean and 
covariance rnatriecs of every type to be mapped, and lltesc catr otrly be estiirrated 
from the training .seis. There is tltur. no provision lor starting a new class ol 
object, no rrratter how strong its signal, and the onN way the el.rssilier can cope 
with it i.s either to force it into one of the existing cla.sses, or put it into a 'don’t 
krrow’ class. 

'I Iris last characteristic of the supervised apirroach retrders it highly unsatisfac- 
tory for many applications. One of the values ol data such as h K1 S is that it is 
available for areas where ground truth is notably lacking, and expensive to 
collect. It is llnis desirable to develop techniques that are applicable in such 
situations, and unsupervised teeliniques seem to ofler the best possibilities. 

The use of nnsnpcivised lechniques does not. moreover antomaikaliy lead to 
the rejection of all prior knowledge of the situation, rather to its use in a rather 
less direct fa.shion. When eml arking on a classification exercise, a great many 
decisions have to be made, often quite arbitrary, ssieb as what field si/e to use, 
what features to extract, the thresholds to be used by whatever cki'isiiier to be 
used, etc. and these decisions are influenced by sueli prior knowledge as is avail- 
able of the scene being analysed, and the purpose oi the investigation. Fur ther, 
where wc have no 'testing sets’ to cv.iluatc the ckrssitiers perforiname, tjiis prior 
information will be used to assess the progress ol the eld.ssifier. 


5. DISCUSSION 

This paper has presented the two fundamental dichotomies in remote sensing 
data analysis, between per-ficld and per-point analyses, and between supervised 
and unsupervised classifiers. Whereas the per-point. supervised analysis has been 
developed to a poim where its :t)ipheaiion tiv well dcimed proldems N virtually 
routine, the per-ficld analysis olfers immensely more possibilities in terms of 
scale of result and features av.iil.ible. and the unsupervised teehimiues oiler 
ailvanl;i;'.c:i in apjrlieation to ii.-w ■ it'ial ions. 

However, the greater ilexibiliiy and usefulness ol Hie unsupervised p"r-tield 
classifier is gained at the expense o]‘ mer cased compiUiTig time, so rr.ueh so ilral 
U is difficult to s"C how srie'i ; '' O ot t ..'L'liiiiques eoukl ''vci be apt-b. d on , 
routine basis to a continuous d.iU site. tin such as ili.it potentraliy avar.ah.e truiii 
FRTS, N'cvei I’teh-.-, the gie.al siu i e-si.'s of the per-point supeivised analysis must 
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not be Allowed to dominate remote sensing data analysis to llic extent that the 
altci natives aie overiuuked. 
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The relative performance of eome tuisupcrviscd clustering 
techniques for the per-Field classification of LAh"*)SAT 

data 


by A. C. ARMSTRONG 


ABSTOACT: 

Three algoritlims for the per-field analysis of LAKDSAT 
data are described* Methods of generating per-field data sets 
are briefly outlined, and the performance of the three algorithms 
discussed* At any given spatial scale, the traditional hierarchical 
algorithm produces better classifications than a multi-pass 
iterative technique, which is in turn better than a single ' 
pass chain algorithm* However, the computation requirements 
of these algorithms is such that for large data sets, the 
hierarchicil technique is too expensive, and the iterative 
technique always more expensive than the chain* 


St- s.<\ 


The value of an unsupervised approach to the large scale 
generalisation of MTS data has already been demonstrated by 
Armstrong & Clayton (1) • In this paper the comparative 
performance of tliree algorithms for these operations are 
discussed* As in most eorsiputlng applications/ there Is a trade- 
off between the computing time required to achieve the results/ 
and the quality of those results* However in this case the 
problem is confounded with that of the scale of the investigation 
which affects the suitability of the algorithms* 


The Algorithms * 

The three algorithms discussed here ore by no means on 
exhaustive sample of the many clustering teclmiqucs available, 
but represent three basic types of approach* These algorithms 
are: 

(1) the classical hierarchical technique for maximising 
an objective fxmetion/ derived initially by Ward (2)* 

(ii) An iterative technique described by Swain (3) 
which majee^i an initial guess at the results, and then improves 
upon that initial estimate* 
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(ill) the chain technique described by Donner (4) and 

implcMcnted In the remote sensing field by Nagy ct al (5)« 

These three tecluiiquec represent three levels of data 

manipulation: the first technique tabes all the data and from It 

produces a derived data set, the interdistance matrix, on which 

it performs all subsequent calculations; the second examines 

each data point in turn, and does so several times; the third 

performs a single assignment operation to each point in turn, 

thus scanning the data file only once* 

Although it is not necessary to describe these algorithms 

in detail, since they are well described elsewhere, some of the 

problems that arise when implementing these techniques for remote 

sensing applications v.’lll be discussed* 

•• 

The nierarc!iical Tochnariuo . 

This algorithm takes the data, and from it calculates a matrix 
of distances between each point* All subsequent steps in the 
algorithm arc based on this inter distance matrix, and thus this 
initial stage is critical to the results obtained* In some 
applications it is conventional to normalise the measurement 
vectors before this step, so that all variables are given equal 
ir;pO):tance in the ar.Mycic* This procedure has the effect of 
reducing variables collected on differing measuring schemes to a 
corunon standard, but might equally swAi.ip out any signal present 
■..‘ith r.oijC ^ or equally enhance subtle patterns that night be 
otherwise overlooked. It is thus necessary to have some p-iori 

4 

knov;lcdi,e of the characteristics of the data before this step 
can bo taken. Ci.a \/ay of onsuriijg that the input data has ivell 
defined characteristics is to perform a principal conpouents 
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tjansformatloji of thn <‘uiia before* this step in the analysis. 
Althoush such a transfocued data set v/ould enable the use of an 
optir.»ally efficient subset of the data, it requires large anounts 
of coiuputing time to be implemented, and it is not suited to remote 
sensing n, .plications which normally involve large data sets. 

A second consideration at the stage of calculating the 
intcrdistancc matrix is the choice of an appropriate distance 
measure. Usually Euclidean distance is taken without much further 
consideration, ito-rever other schemes can seem equally attractive 
for specific applications. City-block distance has often been 
used as a cheaper alternative to Euclidean distance, but in view 
of the critical nature of this step, and its relatively small 
contribution to the total computing effort, the o;,ving is not 
sufficiently great to argue for its use unless a „pr.io£,^ reasons 


indicate that it is appropriate. 

Once the critical step cf defining the interdistance 

measure has been calculated, the algorithm follows a standard 
procedure, of scanning the intardisvance matrix for the two 
points closest together, then replacing those two points by their 
weighted mean. Thus to cluster n points this technique requires 
that a matrix of intcrdistanccs, which is (n-l)X(n-2)’ in S3ze^ 
be stored, and scanned (n-1) times. There are thus very real 
limits on the size of data set that can be handled by this 
algorithm. Most cor.iputer systems can hold up to 2C0 points, 
but beyond this limit resources tend to become strained. Openshaw 
(6) has sho-.m that if, for a. spatially organised data set, wc 
ijuponc the constraint that only contiguous < ointc can be joined, 
it becomes unnecessary to store ..ost of the interdistanoc 
matrix, and that a considerable saving can tl'.en be achieved, 
such that up to 1200 points can be easily handled. Jlowsver, impos 
such r constraiJt vrun fhedata requires that wc a priori know adjar. 
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points to be similar, and' also raiser, problems when non conticuous 
areas are in fact similar* 

The hierarchical technique v;as implemented for this study 
using the FORThAiN program published by Veldr.ian <7) and the data 
were all treated in an imnornialised fashion, using the euclidean 
definition of distance throughout. 

(ii) The iterative technique 

’ The iterative technique discussed by Swain (3) has the 


advantage thut it dees not require large amounts cf cenputer store, 
since the data are alv;ays accessed sequentially, and can thus be 
stored on a temporary device ouch as magnetic tape or disc. This 
algorithm taken some initial estimates of the cluster centers to 
be derived, the data are then assigned in turn to the nearest of 
these cluster centers. Each cluster is then replaced by the mean 
of the data points assigned to it, and the process repeated until 
a stable assignment of points achieved. This technique however 

docs have several disadvantages; 

(i) an 3 priori estimate o- the number of clusters to 
be erctractod is required. Often the investigator has no idea 
of this number, and so usually a value in excess of that 
required is, given. Usually it requires several attempts to find 
a useful classification. 

(ii) initial estirates of the cluster centers must be 
made. Unless some prior information is available, these must 
be determined arbitrarily, i.’c have found that these arc best 
determined by nrsigring some cf the data points, evenly spaced 
through the data set, to these cluster centers. The use of rny 

r'cheme urnrUy resTilts in none cf the initial 
centers being outside the cloud of d:'te points, and rc-oining 
c:'.pty threugheut, so that tl;cy involve ed.diticnal co.nv.itation -.'ith 

jiQ ii'lect cn the results. 
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(lil) the technique tends to use large amounts of computer 
time. To classify n points into m clusters requires the (n x m) 
distances be classified every iteration* It was found that in most 
circumstances the time taken to perform the analysis did not greatly 
depend on the number of cluster centers, but on the number of data 
points considered, and that a limit of 1000 points was all that could 
be routinely used* However, some results are presented here for a 
larger data set which involved much larger amounts of time than were 
routinely available* It is however possible to reduce the time 
requirements of this technique by stopping the procedure before 
convergence* Examining the reclassifications that occur at each 
iteration wc observed that the matrix of transitions soon becomes very 
nearly empty, and that all the reclassifications are bc.-ween two or 
three clusters* 'Ve also observe that these clusters are very close 
together in the feature space. Thus we find that the algorithm 
converges to a stable eolation for a large proportion of the data set 
(over 95 %) within 5 or 6 iterations, and that it takes another 20 or 
so iterations to resolve the residual uncertainties* If vjc accept 
this small level of uncertainty in the results, wc can achieve a 
solution that will probably be as good for our purposes ’./ithin a much 
shorter time* 

(iv) there is no guarantee that the resultant clusters 
do not overlap in the measurement space. It thus becomes necessary 
to examine the •separability* of the derived clusters, so that all 
clusters that are not truly separable can he a:,ialganated. In the 
original schcriO discussed by S’-ain this separability v/as deter. lined 


by tl'.c S..-ai;i-ru di.'vlenco* Ilovrever l!u.s CLrt::.'zz mcas'.’re, and 
indeed all othoc ^ensures ccrtionly used, require that r.t lC7.st 




one variMice-covariance matrix be Inverted for each distance 


measure calculated, Siuce this technique was intended for use v«'.th 
data sots measured on many variables (up to 128), it becoraes impractical 
to store many of these large matrices, and exceedingly expensive in 
computing terms to invert them. For this investigation an alternative 
distance measure was devised, which had the form; 






t . i"./ 

g a; 
o 

or h; 

j-x. 


where P. . is the distance between clusters i and j 

X is the mor.n of cluster i on the Ic-th variable 
ik 


ik 


is the standard deviation of the k-th variable in 


cluster i. 

This distance measure has no distribution theory attached to it, and 
so it remains essentially a descriptive, and not an inferential 
statistic. It has a lower bound of zero, but no upper bouiid. Values 
of less than 1.0 indicate that on average two clusters overlap to soir.c 
extent, and values of 0.5 or less indicate considerable overlaps. 

In practice, the relative values of this statistic was found to be the 
most valuable, and the moan of its value gave an overall indication of 
the clarity of the classification that coiild be derived from the data 

set in question. 

(iii) The Chain Alnorithn 

This algorithm was first described by Conner, (4) and includes 
some of the modifications discussed by Nagy and Tolaba (3) for use 
in the remote sensing context. This technique involves scanning the 
data in sequence, and for each point calculating the distance bctv;ccn 
that point and a list of cluster means, if the point is nearer to the 






nearest clliiter than a given tlireshold value then it is joined to that 
duster, but jr not, it is used to initiate a new cluster. The only 
a priori in/orfflation that the algoritlm requires is the value of this 
threshold, which has the dimensions of distance in the measurement 
space* It Mas found that in practice the value of this thrcsliold could 
only bo determined by trial and error, and further that the performaiice 
of the .classifier was sensitive to this value over quite a narrow range 
which could only be determined by exajiiining the effects on the analysis 
of varying the threshold* The sensitivity uf the results to this 
parameter which can not be determined a priori v/as the major disadvantage 
of this technique, although this was the sole problem of implementation. 

Thus, this technique starts with an empty list of clusters, 
which is gradually filled as the points are e:rajiiined in turn. As the 
progreiruae proceeds, the number of' distances calculated for each point 
increases as the number of cluster centers increases* However, some 
of the new clusters that are formed are only aberrant single points, 
and to prevent the programiae maintaining these aberrant points as 
Cluster means, after specific numbers of points have been examined all 
cluster means are examined, and those including only a specified small 
portion of the data, or which have not been used v.'ithin a certain period 
of tine, are deleted from the cluster list. Although this procedure 
involves the estimation of two further parameters, for which we have 
no ^ priori guides, the performance of the classifier is not sensitive 
to their values and the ovcraJ.1 efficiency of the progra:.ine is much 
improved by the inclusion of this deletion routine* 

The Df.ta 


Although many investigators have clar-rified }'J'.TS data 
point I'-/ .'.-r M;. boon ; a U t's'i'ircls 

f4 genorallg accep'. c ochpiio liv't operates on g.r'.u!p:- of pi;vOlr, 




or ’fields** In order to anltieve j,cn 2 rr.lis<’.tion over large arcas/ 
it is efficient to amalgaiiate the data into fields before class- 
ification is attempted. Once ouch fields have been defined it is 
also jiecessary to extract features fron those fields so that 

classification can be achieved. 

Field definition involves defining the arrays of pixels 
that are to be the basic units to be classified. Convenience 
dictates that regular arrays of pixels constitute the fields to be used, 
in this sttidy three basic sizes of field were used, all being square 
arrays of pixels vfhich refer to para! lelogr arc shaped areas on the 
ground, covering 50 x 50, 100 x 100, and .rjO x 200 pixels respectively 
(Table !)• It was possible to use this hierarchy of fields to 
exaraine the effect of varying field siaes on the performance of the 

classifiers. 

There are feu a priori guides as to what might be an 
effective set of features that might be put into the classifier. 
Experience to date with L/U^DSAT imagery has shovm that for most 
applications the use of data from all four spectral bands is better 
than analyses of single bands. For this study the information 
contained in the gray scale histograi.js for each field was used as 
the feature file for input into the classifier. For each field the 
four spectral bands were each split into 32 equal classes and the 
number of pixels with values in each of the total of 12S classes 
recorded. This vector of 128 nui.bcrs was then used as the feature 


vector for each field* 
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One ndvttntago of this featuce scheme is that it Is 
possible to amalgancite the data for groups of fields pinply 
by adding the feature vectors* It v/as thus possible to 
perforin analyses at the tlirec scales of field from the one 
data set/, that using the smallest fields/ thereby enabling a 
considerable efficiency to be implemented. Such an 
efficiency would not be achieved by some alternative schemes/ 
such as the use of various transforms to measure tejrturc/ 
which would require rc-computing for every field size used. 

" This feature set has the further advantage that/ because 
it represents the frequency distribution directly/ it is not 
in any v/ay influenced by problems that arise in the processing 
of polymodal distributions/ as are measures based on moments 
of the distributions. Further/ for most purposes the moments 
of the distributions could, if required, be estimated frOm 
this data, and thus the feature file compiled for input 
for the classifier can act as a valuable secondary data 

source. 

Lastly this feature file results in a measurement space 
that does not require normalisation. The values on each of 
the 12f5 variables are ricasitred in the same units, and the 


variables are directly scaled by their areal importance. 


Result, s 

Three sets of images were used to test the performance 
of the algorithms, two of California, and one of Sastern England. 


(Table 2). The L.iagcs of Central California ch'ov; a very 


clear regicr.:.lisi‘t.*.cn into r. 

s-'ri 

cc of parallel 

4 1 1 • 

trending acneo, ..'..ic.i :J nt' ia 


e a repotj»taOi^ 

of ou;.ic a-o..os 

cither cid..’ cf ‘*»e arri gated 

V e 

, in th: centra: 

L Valley. The 

tv.'O images are of the ai 

•ca. 

separated by a 

period, of 


S%- 



9 Months, ilic first being taken in August '72, and the second 
in May *73. The contrast bet^/ecn the ii.iages higlilights 
the effects that seasonality way have on the imagery. 

The image of Eastern England is considcrr.bly less clear, 
due to the presence of cloud in the ’.Vest of the image, and 
a very large amount of atmospheric haze. This image 
represents a much more severe test of -the classifiers* 

The performance of the three algoritiims in terns of 
their consumption of computing resources is shown in Table 3. 

As can be seen the three algorithas differ greatly in their 
requirements of computing time, although it was found that 
these tine requirements did not vary significantly v;ith the data 
sets used. Several points can be made in the light of this 
table. 


Firstly it is not possible to use the hierarchical technique 
for anytlung other than the largest fields, since the requirements 
of core space become prohibitively large. Secondly the 
iterative technique becomes very expensive to operate in 
terns of the consumption of time, with large numbers of 
points, and that it is not possible to use this algoritlin 
for routine application to the smaJl 50 x SO fields. 

Although it was not possible to devise an objective 
criterion of the accuracy of the classifications, it was 
possible to give a subjective evaluation in terns of the 
interpretability and spatial coherence of the classes, vdsich 
were in turn related to our a priori luiowledgc of the gegional 
geographies of the two areas, lu ail cases it was seen that 

the 'best' classifier was always the expensive. 


that is the hierarchical technique al-.;ay,5 produced better 

clDEses than the hierarchical, which .;as in turn always better 

;JO'7?ver the dif f croucos between tiic 
rQ- ^ 


tl’.nn th’’; iterative. 






iterative and chain techniques when applied to the large 
data sot of 2300 fields are small, and indicate that in view 
of the computational economy it offers, the chain technique 
shot4d be used. Thus in defining the size of fields to be 
used, the investigator in effect determines the algorithm 
that is appropriate. 

The problem of the choice of scale is however difficult, 
for it is possible to observe that altering the field size 
Is equivalent to reducing the resolution of the classification 
scheme, and we find that in altering the resolution ms are 
not only able to locate the same feature more accurately 
but also to sec smaller fcattires. This change is most marked 
if we compare the classifications using the large 200 x 200 fields 
and the small 50 x 50 fields for the images of California. The 
classification using the large fields for this reproduces 
almost exactly the broad zonations that v;e see in the Image. 
Kowevor, the low resolution of the system means that 
boundaries of these zones are located only crudely. Moving 
to smaller fields however does not just reproduce the same 
classification with better definition, but produces r. 

^ q.^^<^ffcrfint kind of classification. The features ti;at form 
part of the broad classes defined using the large fields 
become individual classes in their own right, so that a far 
more complex pattern arises. Thtjs by reducing the scale of 
the input, the classifiers respond by changing the scale of 
the cl Losses produced. 

This prcblcr. of field sine is not independent of the 
j*::-;. u: -U h. ve ' thil tho 2':'0 :: ICO fields preferably 

cla:.si'::-'d using Ihe hie'T.rchical technique, pro'^ucc very 
satisfactory classif icatior. rf Califcn-.i.' . The same seke .e 
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however when applied to the ir.ace of Jiastern P.:i[;land Is 
considerably less successful, and it is possible to interpret 
only three groups froiu the classifier; clear sea, turbid 
water, and land. Moving to a sraaller field size greatly, 
improves the performance of the classifier, which then starts 
to produce coherent and interpretable classes in the land. 
However even at this snialler scale the algorithms do not 
function very well with this poor quality image. 

It is possible to conclude that the choice of an 
appropriate scale depends not only upon the required resolution 
of the classified map, but also upon the nature of the under- 
lying terrain. Vfhere the scale of the underlying spatial 
Clements is small, as with the English imagery, small fields 
are indicated, bxtt where the landscape is made up of larger 
imits, so larger fields can be used. 
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Table 1 




Fields Used study 


1 

Size of Fields 
in Pixels 

Horizontal 
Size (iCm) 

Vertical 
Size (Km) 

Kurftcr in an 
laiTS scene 

50 X 50 

2.S 

3.7 

2806 

100 X 100 

5.7 

7.4 

704 

200 X 200 

11.4 

14,8 

176 


Table 2 



■ 

Images discussed 


Local code 

BRTS frame 

Date 

Area 

Iiaage D 
Image E 
liiage C 

1056-18114 
1308-18122 
1031- *10334 

17 sept 1072 
27 Kay 1973 
23 August 1973 

central California 

II •' 

Eastern England 



Table 3 



Performance of the Algorithms 


• 

Algorithr.i 

» 

• 

Humber Points 

Core 

CP Time 

Tape 

. I/O time 

Hierarchical 

176 

190 

31 8 

15 s 

Iterative 

176 

60 

60 9 

15 s 


704 

76 

4 ra 

24 9 


2806 

114 

19 M 

IS m 

Chain 

176 

60 

15 8 

10 8 


704 

80 

20 s 

24 s 


2C06 

. 9S 

60 s 

30 s 


Note; 

Core rr.er.5urecl in Kbytes (i«e* v;ords x 4 x 1000) 
iiachino used ’.ras IBM 370/165 

The tir.es ace total airounts of ti;r.e used, v;hich are considerably less 
than the total real tii^c that the Jobs are in the nachine* 
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Abstract 


The value of repeated iMgary free oatemte platforu, can ont) 
realieed if coupled ,o rapid upping e,e,e„,. 

tepid, user oriented mps fron UNDSAT data arc discussed, and eaanp 
derived fron the mapping of the coastal and off-shore areas of caste 


Rngland. 
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1« Introduction: 

Conventional surveying and cartographic techniques arc usually 
.'Considered cxccss^.vely expencive and far too slov/ for the iionJ toeing 
■ pf rapidly changing areas* It is the a.iit of this p.apcr to cho\f how 
remotely sensed data can provide U*e hasis for cheap rapid cartography^ 
which can be tailored to suit a potential user's needs* Remotely sensed 
data, particularly from the L/dJnSAT series of catcUitcr. (previously 
known as BRTS) has proved its utility over large areas of the world, and 
particularly in the United States (for example papers see Preden, Kercanti 
& Becker, 3) However its utility in the British context is less 

clear, since excellent cartographic coverage of most environmental features 
already exists, and the need for repeated coverage with a spatial resolution 
of only 80 m is questionable. Vfo will attempt to show here that some very 
siihple digital techniques are capable of providing rapid, useful cartography 
of changing areas* The example we consider is tluit of mapping the changing 
sand banks and suspended sediment distributions in coastal areas, providing 
maps that are relevant to navigation, sedimentology, recreational planning 
and coastal protection. The major advantage of this method l £ mapping is 
the speed with which it can produce a specialist map of an area, although 
in doing so it may sacrifice some of the absolute cartographic standards 

I 

that are normally expected. The maps can be produced at an extremely low 
cost and the system will allow regular re-napping at frequent intervals. 

2* Data Characteristics: 

The specifications of the LAJoSAT data collection system has been 

7 

described in detail in several sources so only a brief summary will be given 

here* 

The tw satellites are Jn near polar, sun-aynchronous orbiJsat a height 
of 950 km, and provides the potential for covering the earth every 19 days. 
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The data is collected by two sensing systems^ the llcturn Bean Vidicon camera 

(RBV) and the Multi-Spectral Scanner (MSS). For tccnnical reasons, the data 

1 

has been restricted to that col'lected from the MSS in most cases, and that 
is the sole data source that will be considered here, multispcctral 

scanner is a device for recording the amount of radiation reflected from 
*'a small area of the earth’s surface. Tlic optics of the sensor focus 
on an area about 80 m square and the amount of light returning from this 
area is recorded as a voltage in sensors which operate in four spectral 
bands (see Table 1). A mirror which oscillates over a small angle (-2.8 ) 
the position of the Ground Resolution (GRE) being sensed by the 
element system, scanning a strip across the earth beneath the satellite. 

The responses of the sensors are sampled at regular intervals, corresponding 
to a valxie for every 56 m of the ground. The forward motion of the sensor 
platform moves the scanner forward so that successive scan lines are located 
parallel to one another. In order to ensure complete coverage of the surface 
the MSS senses six sucii strips simultaneously. These signals arc digitized 
to a 64 value scale, and telemetered to a receiving station on the earth, 
wiicrc this raw data string is computer processed by NASA, The continuous 
data stream is split into separate scenes for convenience. The data is 
available in two forms, images and bulk processed digital data on computer 
compatible tapes (CCTs) . 

Images can be created from the data stream by a procedure which is 
lesscntially the reverse of the scanning. A fine light beam is moved in a 
regular scanning raster corresponding to the original scan of the sensor and 


its intensity 
satellite bas 
as 6n T7 ic* ;' 
Such rn ir.aivi 


is varied in the same v;ay as the intensity recorded by the 
cd sensor. This produces an image in much the same way 

n ^ ^ 'lo r ’ ' an tr*n t co*^y can jircf'Ticod photographically, 

has most of the qualities of an ait photograph. 
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.The dicital data source is: subject, at this stage, to certain corrections. 
Theie is a slight variation in the speed of the scan as generated by the 
mirror, so it becomes necessary to ajust the number of data points in each 
scan line to keep these constant for a given scene (usually 3240 points 
pci line). Secondly the variations in sensor response are removed by 
reference to a constant calibration source. Thirdly the digital values in 
bands 4, 5 and 6 are "stretched** from their original 64 unit scale to a 
128 unit scale by a non-linear transform which gives a linear relationship 
between the recorded value (grey scale) and the absolute amount of radiation 
sensed. 

I 

The major geometric distortion in the imagery is caused by the 
earth's rotation beneath the satellite. Although the satellite maintains 
a constant path, the resultant scan lines are displaced to the west as the 
satellite continues its forward notion. Although a mors complete analysis 
of the geometry of the data includes several further corrections^ to a 
first approximation, we can represent the data for a single scene as a 
parallelogram, with 3240 pixels (picture elements) across each scan and 
with 2340 lines to each scan, covering an area 185 x 185 km, (100 x 100 
nautical miles). There are more than 7 million pixels in a single image, 
each one covering an area 79 x 56 m, with the values of the reflectance 
in four spectral bands recorded at each of these points. In view of the 
size of the data set (each scene of 7 million data points is collected 
in 25 seconds) the need for computer handling of the data is obvious, if even 
a small fraction of the information is to be used. 

3. Basis of Computer Mappings 

The basis of computer mapping is the ability to characterise the response 
of a category of surface features in spectral terns. There are several 
ways of approaching this problem, the three most important being;— 

preceding page BLAN^ filmed ^ iq 


<i) an adjhoc Approach which arbitarily splits 




dimensional rcsponse-space into groups. 

«i) Bay«,la„ pattern rcco.p.itio„ tec,.„i,„ea which Identify area, 
of each ereund type, and then uae Ihoae as trainin. areas to develop 
otatlatlcs Which are used to classify all additional points. 

ail) Onsupervlsed classifications, which groups all s«i.n ..ata 
pornts together so that they can he a,no^„^ ,,,,,,, 
types. 

Of these methods the second t, 

second has been used by the LABsk packaee^ and 

the third by Ka.y and Xolaba«. ihe first technique Is by far the simplest, 

nd has been used by nany worher. Including for example Pavld, Perles h 
Verger , 

It is our experience that the 

Ptocedure yields the fastest and nost accurate napping teCu.loue for coasta, 

;;a, and furthernore has the advantage of .reat flexibility, and nminal 

~nts on conputer tine and core space. Xhe alternatives nay be nore 

bant, and offer the possibility of transfer to other data sets, but 

thU advantage Is bought at t,„ expense of greater conplexlty. where the 

o sect as to produce rapid, cheap cartography, the first of these scans to 
be the obvious choice. 

conv.nicnt starling point is to look at the intensity levels in the 
spectral hands over the re.len of Interest. y„ the present stndy a 
"d"Ber of scan lines across the ....ere exan.-.e,, and u e .. , ..fe^e r„ 
Identify areas of land, v.t darh sand, dry hrlsht sand, .nhaUe,: r..rtcr 

Clear water, fron sane h„owled,e of the spectral response of’ these surface- 

typCaS anti thoir position alon- tl. > - » j r 

n line, J or example very low vain 

e i.-coa.cd wita t.-atoi, v.hicJi has a low reflectance in thiV. 

P-rt of the apoc.run. High values of reflectance in band . over the l).ter 
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ncaiminn scales, and to displace every twcnxicTn iiiic unc — — -- 

rcct.s for skew. Tliis removes all but 3% of the vertical scale distortion and all but 5% 
of the skew, ('iving a map with a scale of very nearly 1:22,000 (to be exact 0*17 or 

about 3 inches to 1 mile). Although the resolution of the map is low, objects and bound- 

. • I 

aries can be positioned relative to each other within about 100 m and every data point 
(excluding a very small number of spurious points arising from electronic noise) represents 
a "real’* piece of information about th^ location and is not based on interpolation as is 
often the case in map production. Reducing the output dimensions by a factor of ten gives 
a scale of nearly 1;230,000 and a map of potentially great utility. 

4 Results 

The results present here ate two maps showing the hash, and tlie Humber Estuary. The 
data was derived from UNDSAT image F.RTS-1031-10334, taken on 23rd August, 1974 at 9.40 an. 
T3iese maps were originally output into a line printer and hand coloured. Since these 
original outputs were some 4 m in area, photo-reduction was essential. 

For the purposes of this paper, the output has also been agreed by a graph plotter, 

to produce a black and white map. Tlic graph plotter has the potential of bing able to accept 

\ 

considerably more sophisticated geometric corrections, a continuous range of scaling, and 
a larger range of symbols, so for the production of high quality maps production of high 
quality maps such a system would be preferable. David et al.^ have in fact implemented 
such a system using a plotter with six different coloured pens. However it is our experience 
that plotter systems are slow and expensive compared to the line printer, and that they arc 
best used only for production of final I’.aps, and then only if economically justified. 

The accuracy of the resulting maps is difficult to assess quantitatively, since they 
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were collected in the absence of ground truth, and it is possible only to make some quali- 
tative statements. The presence of sea fog off the North Norfolk coast prevents the es ten- 
sion of the map into that area. This demonstrates one of the main restrictions of remotely 
sensed data, that they arc affected storngly by atmospheric conditions (except for such 
systems as $LAR) . Another error has been a relatively 3iign rate of •iis-classif ica ti on 
of the land. In particular the confusion between urban areas and mud flats surh that no:. ^ 
of the urban area of Mult h 05 ; bern clar, n ifird as nulfiat. This is due tc Jic io ■ c . . 
of green vegetation in both which r-sult in low 7R reflectances in l.oth re.-inrs. 
for the present taik of napping coastal areas thi-. 3ms not proved I rouble- '.i c , Am on. -■ 
t3ic roastling has been c .‘>tab1 isnod ilie rii?ct".r«-- i fical ion of fc^ turer. 3:c hind 
tlic shc'c Jii'.P is of no ini.'’:'i':-.t . V;if, .ici u:- i y 1 t ii e j e * o * i-' 
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seems to be associated with -areas of turbid vr.x ter. lii^ure 1 shows a typical 
scan line and the surface catccorics into which various spectral classes 
were assigned. This information was used to construct a decision tree 

(Fig, 2) and every point in the area could be classified using the sequence 
of decisions* 

Once the decision tree has been established it is a simple 
matter to read a scan line at a time, and then to classify each of the 
points along the scan line. If the output of the single line is sent 
straight to a plotting device at this stage there is no need to store the 
classified image and it is thus possible to use a classifier of this type 
on a computer with a very small amount of .core (less than 20K core words). 

However, the nature of the output presents a problem. The fastest and 
easiest technique is to use the computer line printer and this is the device 
that has been used most frequently in the present work. Alternative plotting 
devices such as multicolour p 1 o t t e rs as used by David^, or the colour 
TV screens used by various American ivorkers appear to offer much better 
quality but at an increased cost. 

Using the line printer as the main graphic output device, we wrote 
out a symbol that corresponded to a surface type at each point along the scan 
line. The restriction on the size of a line that can be output on the 
line printer means that it is necessary to reconstruct the image from several | 
sheets of line printer output. Furthermore, the gconctrics of such I 

, output and that of the scann-ir area are not identical. The spacing of the I 

text symbols is ten to the inch in a hori'.ontal direction and six to the inch | 

1 

in a vortical direction, which does not correspond to the 79 x 56 m size j 

of the original pixels and no allowance at ail is made for skew. To 1 

correct for these two effects, it has been found simplest to omit every ! 

sixth line of data^ v.tiich allov.'s for the differences in line printer and | 

t 
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classification is less easy to establish. The classification that ivas 
orieinally established for the Wash was used without modification on the 
Humber which comes from a quite different part of the same scene. In 
particular it is observed that nearly all the shallow water in the Humber 
is classified as mud, and the clear, but still turbid water in the deep 
clmnnel is classified as shallow vfatcr. Ibis implies that in this area 
the water is reflcctir\5morc lj.(jht m all channels, I'jost sitjnificantly in 
the infra-red. TIils is probably due to the higher, turbidity of the 
Humber compared with the waters of the Wash. Although the labels on a 
particular class might be incorrect, their spatial arrangenent makes good 
sense in terms of the known mech.anics of the river. It would be a 
simple matter to device a modified decision tree for the Humber and 
produce a map which classified the areas in a more satisfactory manner. 

An inaccuracy induced by the nature of the scanning system itself, 

' arises from the fact that the six scanners which scan across the image in 
one spectral band do not have exactly the same response , This effect has 
been partially removed during the production of the computer compatible 
tapes by NASA, but some still remains. (Purther corrections have been 
introdu cd by NASA subsequent to the production of the data set used in 
this example.) The effect of this is to introduce a streaky pattern of 


noise on the map, which has a tendency to repeat itself every six lines, 
and is particularly evident in divisions bctv;ccn shallov/, turbid and clear 
water, where the classification involves the arbitary division of a continu- 
ous scale into three clas.‘->es. The noise level is often two grey scale units 
and where classification boundaries arc only three units apart the noise 


can prove a very notivcable feature. This noise can be reno%'od by a simple 
filtering process v.-hirh multiplies the values on eacli line by a constant 


correction factor. 


I 


. .. 




where c.^ is the mc^n rvalue of the reflectance in band i and Mie j is th, 
ilnc mimbcr modulo 6.' llowcyer the correction has not been applied to 

the image used here. 

The detail that can be seen in the map is demonstrated by fig. 4 
tvhich shows a detail from ^ map of the Hunstanton area. Here we see a 
feature Classified as a mud bank running at about 45^ to the coastline, which 
could easily be considered a random classification. However detailed 
examination of air photographs and a field ^heck revealed that the 
feature is a mussel bed some 80 m across. In this instance the 
feature is very near the resolution limit of the scanner optics, yet it 
has been mapped and classified. ' Tv.'o types of beach natcrial were also 
evident, the darker wet sand near the sea edge being easily distinguishable 
from the light coloured acolian sand further inland. However these features 
arc so small that it is virtually impossible to identify then on the photo- 
graphic images from the satellite data. Th-jsto extend UMTOSAT data to^vards 
its resolution limit it is necessary to use computer compatible tapes rather 
than photography. The tapes have a two fold advantage; not only do they 
allow examination of cash individual pixel, but also to quantify inforr.ation 
about the spectral properties of the pixel. 

5, Discussion; • 

It is possible to produce rapid and relatively accurate raps of coastal 
areas. It is our belief that now the data handling routines have been 
established it would be possible to produce a map from a gi\cn tape of 
LAHDSAT data of specified classes, within about a week (i.c. total of 
six jobs on the computer). V.-hcrcas this technique is not capable of initial 
napping of previously unr;npped areas to within acceptable standards, it is 
quite capable of produc-’r.- 1 ’ enatic raps of specific surface types which 
can be controlled in a'usolute location by reference to other more conventional 



cartography. The advantage of this technique in not its locational accuracy 

or fine reaolution, but its ability to produce a spocialint nap to 

specifications desired by the user. Such maps would be ideally suited 

for resource inventories and the nonitoring of rapidly changing areas. 

Possible the main limitation of the method is that the naps are essentially 

maps of spectral classes and the greatest difficulty arises in assigning 

a spectral class unambiguously a surface type. Thus it is necessary for 

the user to be aware of the limitations of the classification system, or 

where the maps are to be, used by non-specialists, that spurious data 

points and areas of mis-classif ication be cleaned up by hand or further 

computer routines. With the construction of a receiving station • at Fucino, 

Italy, it is potentially possible to obtain coverage of a given area of 

the British Isles every 18 days and tluis, within the limitations of cloud 

! 

conditions, regular nonitoring by the methods described is eminently 
feasible. • 
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The banfln of the LAlIDlrJAT I.Iulticpectral Scar-.-^cr (Tlljr;) 
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'■i’he Ihinatcntou area r.r; classified by the 
ruKl by visual inspection OT air nhotoi^raphs. 
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